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<S (S4) Title: IDENTIFICATION AND CHARACTERIZATION OF AN ANTHOCYANW MUTANT (ANT1) IN TOMATO 
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O (57) Abstract: The present invention is directed to a novel plant phenotype, designated Arahocyanin I {ANTI), a nucleic acid se- 
^ quence expressed in plants demonstrating the ANT I phenotype and the corresponding amino acid sequence. A!so provided are plant 
r cells that exhibit modified ANTI expression. 
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IDENTIFICATION AND CHARACTERIZATION OF AN 
ANTHOCYANIN MUTANT (ANT1) IN TOMATO 



REFERENCE TO RELATED APPLICATIONS 
[0001] This Application claims priority to U.S. provisional patent application serial No. 
60/244,685, filed October 30, 2000, the contents of which are hereby incorporated in 
its entirety. 

FIELD OF THE INVENTION 
[0002] The present invention relates to a plant phenotype, designated Anthocyanin 1 
(ANT1)> together with DNA and polypeptide sequences associated with the same. 

BACKGROUND OF THE INVENTION 
[0003] The traditional methods for gene discovery, including chemical mutagenesis, 
irradiation and T-DNA insertion, used to screen loss of function mutants have 
limitations. Mutagenic methods such as these rarely identify genes that are redundant 
in the genome, and gene characterization is time-consuming and laborious. 
[0004] Activation tagging is a method by which genes are randomly and strongly up- 
regulated on a genome-wide scale, after which specific phenotypes are screened for 
and selected. Isolation of mutants by activation tagging has been reported (Hayashi et 
a/., 1992). An activation T-DNA tagging construct was used to activate genes in 
tobacco cell culture allowing the cells to grow in the absence of plant growth 
hormones (Walden et a/., 1994). Genes have been isolated from plant genomic 
sequences flanking the T-DNA tag and putatively assigned to plant growth hormone 
responses. (See, e.g., Miklashevichs et al. 1997, Harling et a/., 1997; Walden et. ai., 
1994; and Schell et al. f 1998, which discusses related studies.) 
[0005] The first gene characterized in Arabidopsis using activation tagging was a gene 
encoding the histone kinase involved in the cytokinin signal transduction pathway. 
The gene sequence was isolated from plant genomic DNA by plasmid rescue and the 
role of the gene, GK71, in cytokinin responses in plants was confirmed by re- 
introduction into Arabidopsis (Kakimoto, 1996). This was followed by reports of 
several dominant mutants such as TINY, LHY and SHI using a similar approach along 
with the Ds transposable element (Wilson et al, 1996, Schaffer et a/., 1998, Fridborg 
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et aL, 1999). In a more recent report, activation T-DNA tagging and screening plants 
for an early flowering phenotype led to the isolation of the FT gene (Kardailsky et of.. 
1999). 

[0006] The potential application of activation tagging as a high through put technology for 
gene discovery has been demonstrated based on screening of several dominant mutant 
genes involved in photoreceptor, brassinosteroid, gibberellin and flowering signal 
pathways, as well as disease resistance. (See, e.g., Weigel et al. % 2000, Chris tensen et 
aL, 1998; Kardailsky et at, 1999). 

[0007] Arabidopsis has been widely used as a model for plant improvement for plants 
such as Brassica species having a siliques type of fruit However, Arabidopsis does 
not serve as a model for plants having a fleshy fruit 

[0008] A method for identifying and characterizing genes based on modified gene 

expression in fruit-bearing plants is described in PCT publication WO0053794. Dwarf 
varieties of fruit-bearing plants, particularly dwarf varieties of tomato, are useful in 
the overexpression of one or more native plant genes and in correlating that 
overexpression with a particular phenotype. 

[0009] Dwarf tomatoes are characterized by their short internodes, which give plants a 
compact appearance. The miniature Lycopersicon esculentum culttvar, Micro-Tom is 
a proportionally dwarfed plant that grows at high density (up to 1357 plants/m" 2 ), has a 
short life cycle (70-80 days from sowing to fruit ripening), and for which fruit size, 
and leaf size have been genetically reduced (Meissner et a/., 1997; Scott and 
Harbaugh, 1989). In addition, Micro-Tom has been shown to be resistant to a number 
of diseases and can be transformed at frequencies of up to 80% through 
Agrobacterium-medmted transformation of cotyledons (Meissner et a/., 1997). Similar 
to Micro-Tom, Florida Pfetite (Fla. Agr. Expt Sta. Circ. S-285), Tiny Tim and Small Fry 
are dwarf varieties of tomato which have a short life cycle, and for which fruit size, and 
leaf size have been genetically reduced. 

[0010] Efforts are underway in industry and academia to develop a means to identify 
genes associated with particular plant traits or characteristics in order to develop 
improved plants having such traits. The present invention provides a plant phenotype 
associated with modified expression of a native plant gene. 

[0011] In an activation tagging screen in Micro-Tom, we identified a gene involved in 
pigment production. Anthocyanins are pigments that are responsible for many cf the 
red and blue colors in plants. The genetic basis of anthocyanin biosynthesis has been 
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well characterized in corn* Petunia, and Antirrhinium (Dooner et al, 1991; Jayaram and 
Peterson, 1990; Quattrocchio Fetal., 1999). 

SUMMARY OF THE INVENTION 

[0012] The invention provides nucleic acid and amino acid sequences associated with the 

Anthocyanin 1 (*ANT1 ") phenotype in plants, presented as modified leaf, flower or fruit color. 

[0013] In one aspect, the invention provides one or more isolated ANT1 nucleic acid 
sequences comprising a nucleic acid sequence that encodes or is complementary to a 
sequence that encodes an ANTJ polypeptide having at least 70%, 80%, 90% or more 
sequence identity to the amino acid sequence presented as SEQ ID NO:2. 

[0014] In another aspect, the polynucleotide comprises a nucleic acid sequence that 
hybridizes, under high, medium, or low stringency conditions to the nucleic acid 
sequence, or fragment thereof, presented as SEQ ID NO: 1, or the complement thereof. 

[0015] In a related aspect, expression of one or more of such ANTJ polynucleotides in a 
plant is associated with the ANT1 phenotype. 

[0016] The invention further provides plant transformation vectors, plant cells, plant parts 
and plants comprising an ANTJ nucleic acid sequence. 

[0017] Expression of such an ANTJ nucleic acid sequence in a plant is associated with the 
ANTJ phenotype, presented as a modified leaf, flower or fruit color phenotype. 

[0018] The expression of an ANTJ nucleic acid sequence may be modified in ornamental 
plants, fruit and vegetable-producing plants, grain-producing plants, oil-producing 
plants and nut-producing plants, as well as other crop plants, resulting in the ANT1 
phenotype. 

[0019] In a further aspect the invention provides a method of modifying the ANTJ 
phenotype in a plant by introducing an ANTJ nucleic acid sequence into plant 
progenitor cells and growing the cells to produce a transgenic plant. 

DETAILED DESCRIPTION OF THE INVENTION 

Definitions 

[0020] Unless otherwise indicated, all technical and scientific terms used herein have the 
same meaning as they would to one skilled in the art of the present invention. 
Practitioners are particularly directed to Sambrook et al. 9 1989, and Ausubel FM et al t 
1993, for definitions and terms of the art. It is to be understood that this invention is 
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not limited to the particular methodology, protocols, and reagents described, as these 
may vary. 

[0021] All publications cited herein, and listed below immediately after the examples, are 
expressly incorporated herein by reference for the purpose of describing and disclosing 
compositions and methodologies that might be used in connection with the invention. 
All cited patents, patent publications, and sequence and other information in 
referenced websites are also incorporated by reference. 

[0022] As used herein, the term "vector" refers to a nucleic acid construct designed for 
transfer between different host cells. An "expression vector" refers to a vector that has 
the ability to incorporate and express heterologous DNA fragments in a foreign cell 
Many prokaryotic and eukaryotic expression vectors are commercially available. 
Selection of appropriate expression vectors is within the knowledge of those having 
skill in the art. 

[0023] A "heterologous" nucleic acid construct or sequence has a portion of the sequence 
which is not native to the plant cell in which it is expressed. Heterologous, with 
respect to a control sequence refers to a control sequence (ie. promoter or enhancer) 
that does not function in nature to regulate the same gene the expression of which it is 
currently regulating. Generally, heterologous nucleic acid sequences are not 
endogenous to the cell or part of the genome in which they are present, and have been 
added to the cell, by infection, transfection, microinjection, electroporation, or the like. 
A "heterologous" nucleic acid construct may contain a control sequence/DNA coding 
sequence combination that is the same as, or different from a control sequence/DNA 
coding sequence combination found in the native plant. 

[0024] As used herein, the term "gene" means the segment of DNA involved in producing 
a polypeptide chain, which may or may not include regions preceding and following 
the coding region, e.g. 5' untranslated (5 1 UTR) or "leader" sequences and 3* UTR or 
"trailer" sequences, as well as intervening sequences (introns) between individual 
coding segments (exons). 

[0025] As used herein, "percent (%) sequence identity" with respect to a subject sequence, 
or a specified portion of a subject sequence, is defined as the percentage of nucleotides 
or amino acids in the candidate derivative sequence identical with the nucleotides or 
amino acids in the subject sequence (or specified portion thereof)* after aligning the 
sequences and introducing gaps, if necessary to achieve the maximum percent 
sequence identity, as generated by die program WU-BLAST-2.0al9 (Altschul et a/., J. 
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Mol. Biol. (1997) 215:403-410; blastwustl.edu/blast/README.html website) with all 
the search parameters set to default values. The HSP S and HSP S2 parameters are 
dynamic values and are established by the program itself depending upon the 
composition of the particular sequence and composition of the particular database 
against which the sequence of interest is being searched. A % identity value is 
determined by the number of matching identical nucleotides or amino acids divided by 
the sequence length for which the percent identity is being reported. "Percent (%) 
amino acid sequence similarity" is determined by doing the same calculation as for 
determining % amino acid sequence identity, but including conservative amino acid 
substitutions in addition to identical amino acids in the computation. 

[0026] The term "% homology" is used interchangeably herein with the term H % identity." 

[0027] A nucleic acid sequence is considered to be "selectively hybridizable" to a 
reference nucleic acid sequence if the two sequences specifically hybridize to one 
another under moderate to high stringency hybridization and wash conditions. 
Hybridization conditions are based on the melting temperature (Tm) of the nucleic 
acid binding complex or probe. For example, "maximum stringency" typically occurs 
at about Tm-5°C (5° below the Tm of the probe); "high stringency" at about 5-10° 
below the Tm; "intermediate stringency" at about 10-20° below the Tm of the probe; 
and "low stringency" at about 20-25° below the Tm. Functionally, maximum 
stringency conditions may be used to identify sequences having strict identity or near- 
strict identity with the hybridization probe; while high stringency conditions are used 
to identify sequences having about 80% or more sequence identity with the probe. 

[0028] Moderate and high stringency hybridization conditions are well known in the art 
(see, for example, Sambrook, et cd, 1989, Chapters 9 and 1 1, and in Ausubel, F.M., et 
a/., 1993, expressly incorporated by reference herein). An example of high stringency 
conditions includes hybridization at about 42°C in 50% formamide, 5X SSC, 5X 
Denhardt's solution, 0.5% SDS and 100 fig/ml denatured carrier DNA followed by 
washing two times in 2X SSC and 0.5% SDS at room temperature and two additional 
times in 0.1X SSC and 0.5% SDS at 42°C. 

[0029] As used herein, "recombinant" includes reference to a cell or vector, that has been 
modified by the introduction of a heterologous nucleic acid sequence or that the cell is 
derived from a cell so modified. Thus, for example, recombinant cells express genes 
that are not found in identical form within the native (non-recombinant) form of the 
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cell or express native genes that are otherwise abnormally expressed, under expressed 
or not expressed at all as a result of deliberate human intervention. 
[0030] As used herein, the terms "transformed", "stably transformed" or "transgenic" with 
reference to a plant cell means the plant cell has a non-native (heterologous) nucleic 
acid sequence integrated into its genome which is maintained through two or more 
generations. 

[0031] As used herein, the term "expression" refers to the process by which a polypeptide 
is produced based on the nucleic acid sequence of a gene. The process includes both 
transcription and translation. 

[0032] The term "introduced" in the context of inserting a nucleic acid sequence into a 
cell, means "transfection", or "transformation" or "transduction" and includes 
reference to the incorporation of a nucleic acid sequence into a eukaryotic or 
prokaryotic cell where the nucleic acid sequence may be incorporated into the genome 
of the cell (for example, chromosome, plasmid, plastid, or mitochondrial DNA), 
converted into an autonomous replicon, or transiently expressed (for example, 
transfected mRNA). 

[0033] As used herein, a "plant cell" refers to any cell derived from a plant, including cells 
from undifferentiated tissue (e.g., callus) as well as plant seeds, pollen, progagules and 
embryos. 

[0034] As used herein, the terms "native" and "wild-type" relative to a given plant trait or 

phenotype refers to the form in which that trait or phenotype is found in the same 

variety of plant in nature. 
[0035] As used herein, the term "modified" regarding a plant trait, refers to a change in the 

phenotype of a transgenic plant relative to a non-transgenic plant, as it is found in 

nature. 

[0036] As used herein, the term "T| " refers to the generation of plants from the seed of To 
plants. The Ti generation is the first set of transformed plants that can be selected by 
application of a selection agent, e.g., an antibiotic or herbicide, for which the 
transgenic plant contains the corresponding resistance gene. 

[0037] As used herein, the term "T 2 " refers to the generation of plants by self-fertilization 
of the flowers of Ti plants, previously selected as being transgenic. 

[0038] As used herein, the term "plant part" includes any plant organ or tissue including, 
without limitation, seeds, embryos, meristematic regions, callus tissue, !eaves a roots, 
shoots, gametophytes, sporophytes, pollen, and microspores. Plant cells can be 
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obtained from any plant organ or tissue and cultures prepared therefrom. The class of 
plants which can be used in the methods of the present invention is generally as broad 
as the class of higher plants amenable to transformation techniques, including both 
monocotyledenous and dicotyledenous plants. 

[0039] As used herein, "transgenic plant" includes reference to a plant that comprises 
within its genome a heterologous polynucleotide. Generally, the heterologous 
polynucleotide is stably integrated within the genome such that the polynucleotide is 
passed onto successive generations. The heterologous polynucleotide may be 
integrated into the genome alone or as part of a recombinant expression cassette. 
'Transgenic*' is used herein to include any cell, cell line, callus, tissue, plant part or 
plant, the genotype of which has been altered by the presence of heterologous nucleic 
acid including those transgenics initially so altered as well as those created by sexual 
crosses or asexual propagation from the initial transgenic. 

[0040] Thus a plant having within its cells a heterologous polynucleotide is referred to 
herein as a "transgenic plant*** The heterologous polynucleotide can be either stably 
integrated into the genome, or can be extra-chromosomal. Preferably, the 
polynucleotide of the present invention is stably integrated into the genome such that 
the polynucleotide is passed on to successive generations. The polynucleotide is 
integrated into the genome alone or as part of a recombinant expression cassette. 
"Transgenic" is used herein to include any cell, cell line, callus, tissue, plant part or 
plant, the genotype of which has been altered by the presence of heterologous nucleic 
acids including those transgenics initially so altered as well as those created by sexual 
crosses or asexual reproduction of the initial transgenics. 

[0041] A plant cell, tissue, organ, or plant into which the recombinant DNA constructs 
containing the expression constructs have been introduced is considered 
"transformed*', "transfected", or "transgenic". A transgenic or transformed cell or 
plant also includes progeny of the cell or plant and progeny produced from a breeding 
program employing such a transgenic plant as a parent in a cross and exhibiting an 
altered phenotype resulting from the presence of a recombinant nucleic acid sequence. 
Hence, a plant of the invention will include any plant which has a cell containing a 
construct with introduced nucleic acid sequences, regardless of whether the sequence 
was introduced into the directly through transformation means or introduced by 
generational transfer from a progenitor cell which originally received the construct by 
direct transformation. 
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[0042] The terras "Anthocyanin J "and "AJV77", as used herein encompass native 

Anthocyanin 1 (ANT1) nucleic acid and amino acid sequences, homologues, variants 
and fragments thereof. 

[0043] An "isolated" ANT1 nucleic acid molecule is an ANT1 nucleic acid molecule that is 
identified and separated from at least one contaminant nucleic acid molecule with 
which it is ordinarily associated in the natural source of the ANT1 nucleic acid An 
isolated ANT1 nucleic acid molecule is other than in the form or setting in which it is 
found in nature. However, an isolated ANT1 nucleic acid molecule includes ANTJ 
nucleic acid molecules contained in cells that ordinarily express ANT I where, for 
example, the nucleic acid molecule is in a chromosomal location different from that of 
natural cells. 

[0044] As used herein, the term "mutant" with reference to a polynucleotide sequence or 
gene differs from the corresponding wild type polynucleotide sequence or gene either 
in terms of sequence or expression, where the difference contributes to a modified 
plant phenotype or trait Relative to a plant or plant line, the term "mutant" refers to a 
plant or plant line which has a modified plant phenotype or trait, where the modified 
phenotype or trait is associated with the modified expression of a wild type 
polynucleotide sequence or gene. 

[0045] Generally, a "variant" polynucleotide sequence encodes a "variant" amino acid 
sequence which is altered by one or more amino acids from the reference polypeptide 
sequence. The variant polynucleotide sequence may encode a variant amino acid 
sequence having "conservative" or "non-conservative 11 substitutions. Variant 
polynucleotides may also encode variant amino acid sequences having amino acid 
insertions or deletions, or both. 

[0046] As used herein, the term "phenotype" may be used interchangeably with the term 
"trait". The terms refer to a plant characteristic which is readily observable or 
measurable and results from the interaction of the genetic make-up of the plant with 
the environment in which it develops. Such a phenotype includes chemical changes in 
the plant make-up resulting from enhanced gene expression which may or may not 
result in morphological changes in the plant, but which are measurable using analytical 
techniques known to those of skill in the art. 

[0047] As used herein, the term "interesting phenotype" with reference to a plant produced 
by the methods described herein refers to a readily observable 
phenotype demonstrated by a Ti and/or subsequent generation plant, which is not 
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displayed by a plant that has not been so transformed (and/or is not the progeny of a 
plant that has been so transformed) and represents an improvement in the plant An 
"improvement" is a feature that may enhance the utility of a plant species or variety by 
providing the plant with a unique quality. By unique quality is meant a novel feature 
or a change to an existing feature of the plant species which is a quantitative change 
(increase or decrease) or a qualitative change in a given feature or trait. 



The Identified ANT1 Phenotype and Gene 

[0048] The gene and phenotype of this invention were identified in a screen using 

activation tagging. Activation tagging is a process by which a heterologous nucleic 
acid construct comprising a nucleic acid control sequence, e.g. an enhancer, is inserted 
into a plant genome. The enhancer sequences act to enhance transcription of a one or 
more native plant genes (See, e.g., Walden R, et al. f 1994; Weigel D et al. 2000). 

[0049] Briefly, a large number of tomato (Lycopenium esculentum) cv. Micro-Tom plants 
were transformed with a modified form of the activation tagging vector pSKIOlS 
(Weigel et oZ, 2000), which comprises a T-DNA {Le., the sequence derived from the Ti 
plasmid of Agrobacterium tumifaciens that are transferred to a plant cell host during 
Agrobacterium infection), an enhancer element and a selectable marker gene. The 
construct, pAG3202, is further described in the Examples. Following random 
insertion of pAG3202 into the genome of transformed plants, the enhancer element can 
result in up-regulation genes in the vicinity of the T-DNA insertion, generally within 
5-10 kilobase (kb) of the insertion. In the T t generation, plants were exposed to the 
selective agent in order to specifically recover those plants that expressed the 
selectable marker and therefore harbored insertions of the activation-tagging vector. 
Transformed plants were observed for interesting phenotypes, which are generally 
identified at the Tj, T 2 and/or T 3 generations. Interesting phenotypes may be identified 
based on morphology, a biochemical screen, herbicide tolerance testing, herbicide 
target identification, fungal or bacterial resistance testing, insect or nematode 
resistance testing, screening for stress tolerance, such as drought, salt or antibiotic 
tolerance, and output traits, such as oil, starch, pigment, or vitamin composition. 
Genomic sequence surrounding the T-DNA insertion is analyzed in order to identify 
genes responsible for the interesting phenotypes. Genes responsible for causing such 
phenotypes are identified as attractive targets for manipulate!! for agriculture, feed, 
ornamental plant, and/or pharmaceutical industries. 
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[0050] It will be appreciated that in most cases when a modified phenotype results from 
the enhanced expression of a tagged gene, the phenotype is dominant Id some cases, 
the enhanced expression of a given native plant gene or a fragment thereof may result 
in decreased expression or inacti vation of its homologue or another native plant gene, 
which results in the interesting phenotype. The T-DNA insertion may also result in 
disruption ("loss-of-function") of a native plant gene, in which case the phenotype is 
generally recessive. 

[0051] The present invention provides a modified leaf, flower or fruit color phenotype, 
identified in ACTTAG MicoTom lines that were observed at the callus stage as 
having purple color and purple shoots. Purple plants were derived from purple colored 
caulogenic callus in culture. The clonal plant lines (i.e., additional shoots originating 
from the same purple colored caulogenic callus or those multiplied from the first 
purple plant either in tissue culture or by cuttings in the greenhouse) were identified as 
having purple coloration on leaves, sepals and flowers. The plants were also observed 
to exhibit a modified fruit color described as a deeper red color relative to wild type 
Micro-Tom plants. The phenotype and associated gene have been designated 
Anihocyanin 1 ("ANTl"). 

[0052] The invention also provides a newly identified and isolated nucleic acid sequence 
that was identified by analysis of the genomic DNA sequence surrounding the T-DNA 
insertion correlating with the ANTJ phenotype. In particular, applicants have 
identified and characterized the open reading frame of the ANTJ gene, which is 
specifically overexpressed in plants having the ANT1 phenotype, and which is 
provided in SEQ ID NO: 1 . A detailed description of the isolation and characterization 
of ANT1 is set forth in the Examples. 

Compositions of the Invention 
ANT1 Nucleic adds 

[0053] The ANT1 gene may be used in the development of transgenic plants having a 
desired phenotype. This may be accomplished using the native ANT1 sequence, a 
variant ANT1 sequence or a homologue or fragment thereof. 

[0054] A ANT1 nucleic acid sequence of this invention may be a DNA or RNA sequence, 
derived from genomic DNA, cDNA or mRNA. The nucleic acid sequence may be 
cloned, for example, by isolating genomic DNA from an appropriate scurss, and 
amplifying and cloning the sequence of interest using PCR. Alternatively, nucleic acid 
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sequence may be synthesized, either completely or in part, especially where it is 
desirable to provide plant-preferred sequences. Thus, all or a portion of the desired 
structural gene (that portion of the gene which encodes a polypeptide or protein) may 
be synthesized using codons preferred by a selected host 

[0055] The invention provides a polynucleotide comprising a nucleic acid sequence which 
encodes or is complementary to a sequence which encodes an ANT1 polypeptide 
having the amino acid sequence presented in SEQ ID NO:2 and a polynucleotide 
sequence identical over its entire length to the ANT1 nucleic acid sequence presented 
SEQ ID NO: 1. Hie invention also provides the coding sequence for the mature ANT I 
polypeptide, a variant or fragment thereof, as well as the coding sequence for the 
mature polypeptide or a fragment thereof in a reading frame with other coding 
sequences, such as those encoding a leader or secretory sequence, a pre-, pro-, or 
prepro- protein sequence. 

[0056] A ANT1 polynucleotide can also include non-coding sequences, including for 
example, but not limited to, non-coding 5' and 3' sequences, such as the transcribed, 
untranslated sequences, termination signals, ribosome binding sites, sequences that 
stabilize mRNA, introns, polyadenylation signals, and additional coding sequence that 
encodes additional amino acids. For example, a marker sequence can be included to 
facilitate the purification of the fused polypeptide. Polynucleotides of the present 
invention also include polynucleotides comprising a structural gene and the naturally 
associated sequences that control gene expression. 

[0057] When an isolated polynucleotide of the invention comprises an ANT1 nucleic acid 
sequence flanked by non- ANT1 nucleic acid sequence, the total length of the 
combined polynucleotide is typically less than 25 kb, and usually less than 20kb, or 15 
kb, and in some cases less than 10 kb, or 5 kb. 

[0058] In addition to the ANT1 nucleic acid and corresponding polypeptide sequences 
described herein, it is contemplated that ANT1 variants can be prepared. ANT J 
variants can be prepared by introducing appropriate nucleotide changes into the ANT1 
nucleic acid sequence; by synthesis of the desired ANT1 polypeptide or by altering the 
expression level of the ANTJ gene in plants. Those skilled in the art will appreciate 
that amino acid changes may alter post-trans lational processing of the ANT I 
polypeptide, such as changing the number or position of glycosylation sites or altering 
the membrane anchoring charactmsrlcs, 
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[0059] In one aspect, preferred ANTJ coding sequences include a polynucleotide 

comprising a nucleic acid sequence which encodes or is complementary to a sequence 
which encodes an ANTJ polypeptide having at least 50%. 60%, 70%, 75%, 80%, 85%, 
90%, 95% or more sequence identity to the amino acid sequence presented in SEQ ID 
NO:2. 

[0060] In another aspect, preferred variants include an ANTJ polynucleotide sequence that 
is at least 50% to 60% identical over its entire length to the ANTJ nucleic acid 
sequence presented as SEQ ID NO:l, and nucleic acid sequences that are 
complementary to such an ANTJ sequence. More preferable are ANTJ polynucleotide 
sequences comprise a region having at least 70%, 80%, 85%, 90% or 95% or more 
sequence identity to the ANTJ sequence presented as SEQ ID NO: 1 . 

[0061] In a related aspect, preferred variants include polynucleotides that are be 

"selectively hybridizable" to the ANTJ polynucleotide sequence presented as SEQ ID 
NO:l. 

[0062] Sequence variants also include nucleic acid molecules that encode the same 

polypeptide as encoded by the ANTJ polynucleotide sequence described herein. Thus, 
where the coding frame of an identified nucleic acid molecules is known, for example by 
homology to known genes or by extension of the sequence, it is appreciated that as a 
result of the degeneracy of the genetic code, a number of coding sequences can be 
produced. For example, the triplet CGT encodes the amino acid arginine. Arginineis 
alternatively encoded by CGA, CGC, CGG, AGA, and AGG. Therefore it is 
appreciated that such substitutions in the coding region foil within the sequence variants 
that are covered by the present invention. Any and all of these sequence variants can be 
utilized in the same way as described herein for the identified ANTJ parent sequence, 
SEQ ID NO: 1. 

[0063] It is further appreciated that such sequence variants may or may not selectively 
hybridize to the parent sequence. This would be possible, for example, when the 
sequence variant includes a different codon for each of the amino acids encoded by the 
parent nucleotide. Such variants are, nonetheless, specifically contemplated and 
encompassed by the present invention. In accordance with the present invention, also 
encompassed are sequences that at least 70% identical to such degeneracy-derived 
sequence variants. 

[0064] Although ANT1 nucleotide sequence variants arc preferably capable of hybridizing 
to the nucleotide sequences recited herein under conditions of moderately high or high 
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stringency, there are, in some situations, advantages to using variants based on the 
degeneracy of the code, as described above. For example* codons may be selected to 
increase the rate at which expression of the peptide occurs in a particular prokaryotic or 
eukaryotic organism, in accordance with the optimum codon usage dictated by the 
particular host organism. Alternatively, it may be desirable to produce RNA having 
longer half lives than the mRNA produced by the recited sequences. 

[0065] Variations in the native full-length ANTJ nucleic acid sequence described herein, 
may be made, for example, using any of the techniques and guidelines for conservative 
and non-conservative mutations, as generally known in the art, oligonucleotide- 
mediated (site-directed) mutagenesis, alanine scanning, and PCR mutagenesis. Site- 
directed mutagenesis (Kunkel TA et aL f 1991); cassette mutagenesis (Crameri Aetata 
1995); restriction selection mutagenesis (Haught Cetal., 1994), or other known 
techniques can be performed on the cloned DNA to produce nucleic acid sequences 
encoding ANTJ variants. 

[0066] It is contemplated that the gene sequences associated with the ANTJ phenotype 
may be synthesized, either completely or in part, especially where it is desirable to 
provide host-preferred sequences. Thus, all or a portion of the desired structural gene 
(that portion of the gene which encodes the protein) may be synthesized using codons 
preferred by a selected host. Host-preferred codons may be determined, for example, 
from the codons used most frequently in the proteins expressed in a desired host 
species. 

[0067] It is preferred that an ANT1 polynucleotide encodes an ANTJ polypeptide that 
retains substantially the same biological function or activity as the mature ANTJ 
polypeptide encoded by the polynucleotide set forth as SEQ ID NO: 1 (i.e. results in an 
ANTJ phenotype when overexpressed in a plant). 

[0068] Variants also include fragments of the ANTJ polynucleotide of the invention, 
which can be used to synthesize a full-length ANTJ polynucleotide. Preferred 
embodiments include polynucleotides encoding polypeptide variants wherein 5 to 10, 
1 to 5, 1 to 3, 2, 1 or no amino acid residues of an ANTJ polypeptide sequence of the 
invention are substituted, added or deleted, in any combination. Particularly preferred 
are substitutions, additions, and deletions that are silent such that they do not alter the 
properties or activities of the polynucleotide or polypeptide. 

[0069] A nucleotide sequence encoding aa ANTJ polypeptide car* also be used lo 

construct hybridization probes for further genetic analysis. Screening of a cDNA or 
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genomic library with the selected probe may be conducted using standard procedures, 
such as described in Sambrook et a/., 1989). Hybridization conditions, including 
moderate stringency and high stringency, are provided in Sambrook et ai., supra. 
[0070] The probes or portions thereof may also be employed in PCR techniques to 
generate a pool of sequences for identification of closely related ANT1 sequences. 
When ANTl sequences are intended for use as probes, a particular portion of an ANTl 
encoding sequence, for example a highly conserved portion of the coding sequence 
may be used. 

[0071] For example, an ANTl nucleotide sequence may be used as a hybridization probe 
for a cDNA library to isolate genes, for example, those encoding naturally-occurring 
variants of ANTl from other plant species, which have a desired level of sequence 
identity to the ANTl nucleotide sequence disclosed in SEQ ID NO:l . Exemplary 
probes have a length of about 20 to about 50 bases. 

[0072] In another exemplary approach, a nucleic acid encoding an ANTl polypeptide may 
be obtained by screening selected cDNA or genomic libraries using the deduced amino 
acid sequence disclosed herein, and, if necessary, using conventional primer extension 
procedures as described in Sambrook et aL t supra, to detect ANTl precursors and 
processing intermediates of raRNA that may not have been reverse-transcribed into 
cDNA. 

[0073] As discussed above, nucleic acid sequences of this invention may include genomic, 
cDNA or mRNA sequence. By "encoding" is meant that the sequence corresponds to 
a particular amino acid sequence either in a sense or anti-sense orientation. By 
"extrachromosomal" is meant that the sequence is outside of the plant genome of 
which it is naturally associated. By "recombinant** is meant that the sequence contains 
a genetically engineered modification through manipulation via mutagenesis, 
restriction enzymes, and the like. 

[0074] Once the desired form of an ANTl nucleic acid sequence, homologue, variant or 
fragment thereof, is obtained, it may be modified in a variety of ways. Where the 
sequence involves non-coding flanking regions, the flanking regions may be subjected 
to resection, mutagenesis, etc. Thus, transitions, trans versions, deletions, and 
insertions may be performed on the naturally occurring sequence. 

[0075] With or without such modification, the desired form of the ANTl nucleic acid 
sequence, hcinclogue, variant or fragment thereof, may be incorporated into a plant 
expression vector for transformation of plant cells. 
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ANTl Polypeptides 

[0076] In one preferred embodiment, the invention provides an ANTl polypeptide, having a 
native mature or full-length ANTl polypeptide sequence comprising the sequence 
presented in SEQ ID NO:2. A ANTl polypeptide of the invention can be the mature 
ANTl polypeptide, part of a fusion protein or a fragment or variant of the ANTl 
polypeptide sequence presented in SEQ ID NO:2. 

[0077] Ordinarily, an ANTl polypeptide of the invention has at least 50% to 60% identity 
to an ANTl amino acid sequence over its entire length. More preferable are ANTl 
polypeptide sequences that comprise a region having at least 70%, 80%, 85%, 90% or 
95% or more sequence identity to the ANTl polypeptide sequence of SEQ ID NO:2. 

[0078] Fragments and variants of the ANTl polypeptide sequence of SEQ ID NO:2, are 
also considered to be a part of the invention. A fragment is a variant polypeptide that 
has an amino acid sequence that is entirely the same as part but not all of the amino 
acid sequence of the previously described polypeptides. Exemplary fragments 
comprises at least 10, 20, 30, 40, 50, 75, or 100 contiguous amino acids of SEQ ID 
NO:2. The fragments can be <4 free-standing" or comprised within a larger polypeptide 
of which the fragment forms a part or a region, most preferably as a single continuous 
region. Preferred fragments are biologically active fragments, which are those 
fragments that mediate activities of the polypeptides of the invention, including those 
with similar activity or improved activity or with a decreased activity. Also included 
are those fragments that antigenic or immunogenic in an animal, particularly a human. 

[0079] ANTl polypeptides of the invention also include polypeptides that vary from the 
ANTl polypeptide sequence of SEQ ID NO:2. These variants may be substitutional, 
insertional or deletional variants. The variants typically exhibit the same qualitative 
biological activity as the naturally occurring analogue, although variants can also be 
selected which have modified characteristics as further described below. 

[0080] A "substitution" results from the replacement of one or more nucleotides or amino 
acids by different nucleotides or amino acids, respectively. 

[0081] An "insertion" or "addition" is that change in a nucleotide or amino acid sequence 
which has resulted in die addition of one or more nucleotides or amino acid residues, 
respectively, as compared to the naturally occurring sequence. 

[0082] A "deletion" is defined as a change in either nucleotide or amino acid sequence in 
which one or mors nucleotides cr amine acid residues, respectively, ars absent 
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[0083] Amino acid substitutions are typically of single residues; insertions usually will be 
on the order of from about 1 to 20 amino acids, although considerably larger insertions 
may be tolerated Deletions range from about 1 to about 20 residues, although in some 
cases deletions may be much larger. 

[0084] Substitutions, deletions, insertions or any combination thereof may be used to arrive 
at a final derivative. Generally these changes are done on a few amino acids to 
minimize the alteration of the molecule. However, larger changes may be tolerated in 
certain circumstances. 

[0085] Amino acid substitutions can be the result of replacing one amino acid with another 
amino acid having similar structural and/or chemical properties, such as the replacement 
of a leucine with a serine, Le. t conservative amino acid replacements. Insertions or 
deletions may optionally be in the range of 1 to 5 amino acids. 

[0086] Substitutions are generally made in accordance with known "conservative 

substitutions' 1 . A "conservative substitution" refers to the substitution of an amino acid 
in one class by an amino acid in the same class, where a class is defined by common 
physicochemical amino acid side chain properties and high substitution frequencies in 
homologous proteins found in nature (as determined, e.g. 9 by a standard Dayhoff 
frequency exchange matrix or BLOSUM matrix). (See generally, Doolittle, R J 7 ., 
1986.) 

[0087] A "non-conservative substitution" refers to the substitution of an amino acid in one 
class with an amino acid from another class. 

[0088] ANT1 polypeptide variants typically exhibit the same qualitative biological activity 
as the naturally occurring analogue, although variants also are selected to modify the 
characteristics of the ANT1 polypeptide, as needed. For example, glycosylation sites, 
and more particularly one or more Olinked or N-linked glycosylation sites may be 
altered or removed. Those skilled in the art will appreciate that amino acid changes may 
alter post-translational processes of the ANT1 polypeptide, such as changing the number 
or position of glycosylation sites or altering the membrane anchoring characteristics. 

[0089] The variations can be made using methods known in the art such as oligonucleotide- 
mediated (site-directed) mutagenesis, alanine scanning, and PCR mutagenesis. Site- 
directed mutagenesis [Carter et al 9 1986; Zoller et a/., 1987], cassette mutagenesis 
[Wells et al., 1985], restriction selection mutagenesis [Wells et a/., 1986] or other known 
techniques can be performed o« the cloned DNA to produce ths ANT1 polypeptide 
encoding variant DNA. 



16 



WO 02/055658 PCT/DS01/50638 

[0090] Also included within the definition of ANT1 polypeptides are other related ANT1 
polypeptides. Thus, probe or degenerate PCR primer sequences may be used to find 
other related polypeptides. Useful probe or prima: sequences may be designed to all or 
part of the ANT1 polypeptide sequence, or to sequences outside the coding region. As is 
generally known in the art, preferred PCR primers are from about 1 5 to about 35 
nucleotides in length, with from about 20 to about 30 being preferred, and may contain 
inosine as needed. The conditions for the PCR reaction are generally known in the art 

[0091] Covalent modifications of ANT] polypeptides are also included within the scope of 
this invention. For example, die invention provides ANT1 polypeptides that are a 
mature protein and may comprise additional amino or carboxyl-terminal amino acids, 
or amino acids within the mature polypeptide (for example, when the mature form of 
the protein has more than one polypeptide chain). Such sequences can, for example, 
play a role in the processing of a protein from a precursor to a mature form, allow 
protein transport, shorten or lengthen protein half-life, or facilitate manipulation of the 
protein in assays or production. It is contemplated that cellular enzymes can be used 
to remove any additional amino acids from the mature protein. [See, e.g., Creighton, 
TE, 1983]. 

[0092] In a preferred embodiment, overexpression of an ANT1 polypeptide or variant 
thereof is associated with the ANT1 phenotype. 



Antibodies 

[0093] The present invention further provides anti-AATi polypeptide antibodies. The 
antibodies may be polyclonal, monoclonal, humanized, bispecific or heteroconjugate 
antibodies. 

[0094] Methods of preparing polyclonal antibodies are known to the skilled artisan. Such 
polyclonal antibodies can be produced in a mammal, for example, following one or more 
injections of an immunizing agent, and preferably, an adjuvant. Typically, the 
immunizing agent and/or adjuvant will be injected into the mammal by a series of 
subcutaneous or intraperitoneal injections. The immunizing agent may include an ANT1 
polypeptide or a fusion protein thereof. It may be useful to conjugate the antigen to a 
protein known to be immunogenic in the mammal being immunized. The immunization 
protocol may be determined by one skilled in die art based on standard protocols or by 
routine experimentation, 
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[0095] Alternatively, the mh-ANTl polypeptide antibodies may be monoclonal antibodies. 
Monoclonal antibodies may be produced by hybridomas, wherein a mouse, hamster, or 
other appropriate host animal, is immunized with an immunizing agent to elicit 
lymphocytes that produce or are capable of producing antibodies that will specifically 
bind to the immunizing agent [Kohler etal. y 1975]. Monoclonal antibodies may also be 
made by recombinant DNA methods, such as those described in U.S. Patent No. 
4,816,567. 

[0096] The diiti-ANTl polypeptide antibodies of the invention may further comprise 

humanized antibodies or human antibodies. The term "humanized antibody" refers to 
humanized forms of non-human (e.g., murine) antibodies that are chimeric antibodies, 
immunoglobulin chains or fragments thereof (such as Fv v Fab, Fab', F(ab') 2 or other 
antigen-binding partial sequences of antibodies) which contain some portion of the 
sequence derived from non-human antibody. Methods for humanizing non-human 
antibodies are well known in the art, as further detailed in Jones et aL 9 1986; Riechmann 
et a/., 1988; and Verhoeyen et a/., 1988. Methods for producing human antibodies are 
also known in the ait. See, e.g., Jakobovits, A, et al. f 1995; Jakobovits, A, 1995. 

[0097] In one exemplary approach, anri-AJV77 polyclonal antibodies are used for gene 

isolation. Western blot analysis may be conducted to determine that ANT1 or a related 
protein is present in a crude extract of a particular plant species. When reactivity is 
observed, genes encoding the related protein may be isolated by screening expression 
libraries representing the particular plant species. Expression libraries can be 
constructed in a variety of commercially available vectors, including lambda gtl 1, as 
described in Sambrook, et al. , 1989. 

Utility Of the ANT1 Phenotype and Gene 
[0098] From the foregoing, it can be appreciated that the ANT1 nucleotide sequence, 
protein sequence and phenotype find utility in modulated expression of the ANT1 
protein and the development of non-native phenotypes associated with such modulated 
expression. 

[0099] The ANTI phenotype has features which distinguish the mutant from wild type 

plants, including modified leaf color, modified flower color and modified fruit color. 
[00100] Anthocyanins are known to contribute to leaf color, flower color and fruit color. 

Anthocyanins are a group of water-soluble fiavonoids that impart pink to purpie color 
to leaves and other organs (Harbone et al., 1988). Anthocyanins have been associated 
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with many important physiological and developmental functions in the plants, 
including, but not limited to: (1) modification of the quantity and quality of captured 
light (Barker et al., 1977); (2) protection from the effects of UV-B radiation (Burger 
and Edwards, 1996 and Klaper et al., 1996); (3) defense against herbivores (Coley and 
Kusar, 1996); (4) protection from photoinhibition (Gould et al., 1995 and Dodd et al., 
1998); and (5) scavenging of reactive oxygen intermediates in stressful environments 
(Furuta et al., 1995; Sherwin et al., 1998; and Yamasaki 1997). The anthocyanins 
have demonstrated anti-oxidant activity, suggesting a role in protecting against cancer, 
cardiovascular and liver diseases (Kamei et al., 1993; Suda et al., 1997; and Wang et 
al., 2000). See also the websites at www.psIgroup.com/dg/39fb2.htm, 
www.wellweb.com/nutri/phytochemicals.htm, and 
www.nal.usda.gov/ttitftekt^ 

[00101] Considering this, the ANT1 phenotype described herein is not only colorful and 
hence finds utility in enhancement of the decorative value of ornamental plants, 
flowers and food, it also offers the potential for health benefits when \htANTl 
phenotype is expressed in plant varieties used as foods or food additives. 

[00102] In one aspect, the modified leaf, flower and fruit color of plants having the ANT1 
phenotype finds utility m the development of improved ornamental plants, fruits 
and/or cut flowers. 

[00103] In another aspect, the modified anthocyanin content in plants having the ANT1 
phenotype finds utility in plant-derived food and food additives. 

[00104] In another aspect, as further described in the Examples, the ANT J gene has utility 
as a transformation marker in genetically manipulated plants. 

[00105] In practicing the invention, the ANT1 phenotype and modified ANT1 expression is 
generally applicable to any type of plant, as further detailed below. 

[00106] The methods described herein are generally applicable to all plants. Although 
activation tagging and gene identification was carried out in tomato, following 
identification of a nucleic acid sequence and associated phenotype, the selected gene, a 
homologue, variant or fragment thereof, may be expressed in any type of plant. In one 
aspect, the invention is directed to fruit- and vegetable-bearing plants. 

[00107] The invention is generally applicable to plants which produce fleshy fruits; for example 
but not limited to, tomato (Lycoperskum)\ grape (Vitas)\ ); strawberry (Fragaria)\ raspberry, 
blackberry, loganberry {Rzbxz}; currants and gooseberry (Rites); blueberry, bilberry, 
whortleberry, cranberry (Vaccinium); kiwifruk and Chinese gooseberry (Actinida); apple 
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(Malus); pear (Pyms); melons (Cucumis jp.) members of the Prunus genera, e.g. plum, chery, 
nectarine and peach; sapota (Manilkara zapotilla); mango; avocado; apricot; peaches; cherries; 
pineapple; papaya; passion fruit; citrus; date palm; banana; plantain; and fig. 

[00108] Similarly, the invention is applicable to vegetable plants, including, but not limited to 
sugar beets, green beans, broccoli, brussei sprouts, cabbage, celery, chard, cucumbers, 
eggplants, peppers, pumpkins, rhubarb, winter squash, summer squash, zucchini, lettuce, 
radish, carrot, pea, potato, com, murraya and herbs. 

[00109] In a related aspect, the invention is directed to the cut flower industry, grain- 
producing plants, oil-producing plants and nut-producing plants, as well as other crops 
including, but not limited to, cotton (Gossypium), alfalfa (Medicago sativa), flax 
(Linum usitatissimum), tobacco (Nicotiana), turfgrass (Poaceae family), and other 
forage crops. 

[00110] References describing suitable transformation techniques for these and other plants 
are listed in Patent Application Serial No. 09/846,758. 

[00111] The skilled artisan will recognize that a wide variety of transformation techniques 
exist in the art, and new techniques are continually becoming available. Any technique 
that is suitable for the target host plant can be employed within the scope of the present 
invention. For example, the constructs can be introduced in a variety of forms 
including, but not limited to as a strand of ONA, in a plasmid, or in an artificial 
chromosome. The introduction of the constructs into the target plant cells can be 
accomplished by a variety of techniques, including, but not limited to Agrobacterium- 
mediated transformation, electroporation, microinjection, microprojectile 
bombardment calcium-phosphate-DNA co-precipitation or liposome-mediated 
transformation of a heterologous nucleic acid construct comprising the ANT J coding 
sequence. The transformation of the plant is preferably permanent, Le. by integration 
of the introduced expression constructs into the host plant genome, so that the 
introduced constructs are passed onto successive plant generations. 

[00112] In one embodiment, binary Ti-based vector systems may be used to transfer and 
confirm the association between enhanced expression of an identified gene with a 
particular plant trait or phenotype. Standard Agrobacterium binary vectors are known 
to those of skill in the art and many are commercially available, such as pBU21 
(Clontech Laboratories, Palo Alto, CA). 

[00113] The optimal procedure for transformation of plants with Agrobacterium vectors 
will vary with die type of plant being transformed. Exemplary methods for 
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Agrobacterium-mcdiated transformation include transformation of explants of 
hypocoty], shoot tip, stem or leaf tissue, derived from sterile seedlings and/or plantlets. 
Such transformed plants may be reproduced sexually, or by cell or tissue culture. 
Agrobacterium transformation has been previously described for a large number of 
different types of plants and methods for such transformation may be found in the 
scientific literature. 

[00114] Depending upon the intended use, a heterologous nucleic acid construct may be 
made which comprises a nucleic acid sequence associated with the ANTI phenotype, 
and which encodes the entire protein, or a biologically active portion thereof for 
transformation of plant cells and generation of transgenic plants. 

[00115] The expression of an ANTI nucleic acid sequence or a homologue, variant or 

fragment thereof may be carried out under the control of a constitutive, inducible or 
regulatable promoter. In some cases expression of the ANTI nucleic acid sequence or 
homologue, variant or fragment thereof may regulated in a developmental stage or 
tissue-associated or tissue-specific manner. Accordingly, expression of the nucleic 
acid coding sequences described herein may be regulated with respect to the level of 
expression, the tissue type(s) where expression takes place and/or developmental stage 
of expression leading to a wide spectrum of applications wherein the expression of an 
ANT! coding sequence is modulated in a plant 

[00116] Strong promoters with enhancers may result in a high level of expression. When a 
low level of basal activity is desired, a weak promoter may be a better choice. 
Expression of ANTI nucleic acid sequence or homologue, variant or fragment thereof 
may also be controlled at the level of transcription, by die use of cell type specific 
promoters or promoter elements in the plant expression vector. 

[00117] Numerous promoters useful for heterologous gene expression are available. 

Exemplary constitutive promoters include the raspberry E4 promoter (U.S. Patent Nos. 
5,783,393 and 5,783,394), the 35S CaMV (Jones JD et al, 1992), the CsVMV 
promoter (Verdaguer B et aL f 1998) and the melon actin promoter. Exemplary tissue- 
specific promoters include the tomato E4 and E8 promoters (U.S. Patent No. 
5,859,330) and the tomato 2AII gene promoter (Van Haaren MJJ et a/., 1993). 

[00118] When ANTI sequences are intended for use as probes, a particular portion of an 
ANTI encoding sequence, for example a highly conserved portion of a coding 
sequence may be used 
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[001 19] In yet another aspect, in some cases it may be desirable to inhibit the expression of 
endogenous ANTJ sequences in a host cell. Exemplary methods for practicing this 
aspect of the invention include, but are not limited to antisense suppression (Smith, et 
aL,1988); co-suppression (Napoli, ai.,1989); ribozymes (PCT Publication WO 
97/10328); and combinations of sense and antisense (Waterhouse, et aL, 1998). 
Methods for the suppression of endogenous sequences in a host cell typically employ 
the transcription or transcription and translation of at least a portion of the sequence to 
be suppressed. Such sequences may be homologous to coding as well as non-coding 
regions of the endogenous sequence. In some cases, it may be desirable to inhibit 
expression of the ANT1 nucleotide sequence. This may be accomplished using 
procedures generally employed by those of skill in the art together with the ANT1 
nucleotide sequence provided herein. 

[00120] Standard molecular and genetic tests may be performed to analyze the association 
between a cloned gene and an observed phenotype. A number of other techniques that 
are useful for determining (predicting or confirming) the function of a gene or gene 
product in plants are described below. 



DNA/RNA analysis 

[00121] DNA taken form a mutant plant may be sequenced to identify the mutation at the 
nucleotide level. The mutant phenotype may be rescued by overexpiessing the wild 
type (WT) gene. The stage- and tissue-specific gene expression patterns in mutant vs. 
WT lines, for instance, by in situ hybridization, may be determined. Analysis of the 
methylation status of the gene, especially flanking regulatory regions, may be 
performed. Other suitable techniques include overexpression, ectopic expression, 
expression in other plant species and gene knock-out (reverse genetics, targeted knock- 
out, viral induced gene silencing (VIGS, see Baulcombe D, 1999). 

[00122] In a preferred application, microanray analysis, also known as expression profiling 
or transcript profiling, is used to simultaneously measure differences or induced 
changes in the expression of many different genes. Techniques for microanay analysts 
are well known in the art (Schena M etal., Science (1995) 270:467-470; Baldwin D et 
al. % 1999; Dangond F, Physiol Genomics (2000) 2:53-58; van Hal NL et aL t J 
Biotechnol (2000) 78:271-280; Richmond T and Somerville S, Curr Opin Plant Biol 
(2000) 3: 108 1 16). Micrcarray analysis ef individual tagged lines may be carried o»% 
especially those from which genes have been isolated. Such analysis can identify 
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other genes that are coordinately regulated as a consequence of the overexpression of 
the gene of interest, which may help to place an unknown gene in a particular pathway. 

Gene Product Analysis 
[00123] Analysis of gene products may include recombinant protein expression, antisera 
production, immunolocalization, biochemical assays for catalytic or other activity, 
analysis of phosphorylation status, and analysis of interaction with other proteins via 
yeast two-hybrid assays. 

Pathway Analysis 

[00124] Pathway analysis may include placing a gene or gene product within a particular 
biochemical or signaling pathway based on its overexpression phenotype or by 
sequence homology with related genes. Alternatively, analysis may comprise genetic 
crosses with WT lines and other mutant lines (creating double mutants) to order the 
gene in a pathway, or determining the effect of a mutation on expression of 
downstream "reporter" genes in a pathway. 

Other Analyses 

[00125] Other analyses may be performed to determine or confirm the participation of the 

isolated gene and its product in a particular metabolic or signaling pathway, and to 

help determine gene function. 
[00126] All publications, patents and patent applications are herein expressly incorporated 

by reference in their entirety. 
[00127] While the invention has been described with reference to specific methods and 

embodiments, it will be appreciated that various modifications and changes may be 

made without departing from the invention. 

EXAMPLE 1 

Generation of Plants with an ANT1 Phenotype by Transformation with an Activation 
Tagging Construct 

I. Azrobacterium vector preparation. 
[00128] Mutants were generated using a modified version of the activation tagging 

"ACTTAG" vector, ?SKH» 5 (GenBank Metier [GT] 6537789; Weigel T) et of,, 
2000). This binary vector, called pAG3202, contains the following components: the 
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pSKI backbone; a 4X 35S enhancer consisting of four tandem repeats of the enhancer 
region from the CaMV 35S promoter including 4 AIul-EcoRV fragments in tandem, 
129 bp of CaMV sequence associated with each tandem AIul-EcoRV repeat, and an 
additional 7 bp repeated sequence that is not in the 35S enhancer region of the native 
CaMV genome; the nptll selectable marker under the control of a raspbeny E4 (RE4) 
promoter, an Agrobacterium gene 7 termination element located downstream of the 
nptll gene, adjacent the left border of the plasmid. The pAG3202 sequence is 
provided in SEQ ID NO: 3. 
[00129] Single colonies of Agrobacterium tumefaciens strains EHA 105/EHA 101/GV3101 
containing the binary plasmid pAG3202 were grown in MGL medium at pH S.4 
overnight and diluted to approximately 5x10* cells/ml with MGL or liquid plant co- 
cultivation medium. 

[00130] For long-term storage, PCR-positive colonies were grown in selective media, 

glycerol added to a final concentration of 30% and cultures quick frozen, then stored at 
-80°C. For the initiation of dense Agrobacterium cultures for plant transformation, 
stock cultures were grown in selective media, glycerol added to a final concentration 
of 30%, and a number of 20 yd aliquots quick frozen in liquid nitrogen and stored at - 
80°C. 

II. Transformation and Selection of Micro-Tom Mutants 
[00131] Activation tagging mutants were generated in tomato cv. Micro-Tom using 

Agrobacterium-mediM&d transformation. Sterile seedlings and plantiets were used as 
the source of explants. More specifically hypocotyl tissue was transformed. 
[00132] Seeds of (Lycopersium esculentum) were surface sterilized in 25% bleach with 
tween-20 for IS minutes and rinsed with sterile water before plating on seed 
germination medium (MS salts, Nitsch vitamins, 3% sucrose and 0.7% agar, pH 5.8), 
modified by the addition of auxin and/or cytokinins and giberrellic acid as necessary. 
The cultures were incubated at 24°C with a 16 hr photo period (50-60 ^mol.m'V 1 ). 
Seven to ten day old seedlings and one month old in vitro plants were used for 
hypocotyl explants. 

[00133] Hypocotyls were cut into 3-5 mm segments, then immersed in bacterial 

suspension, blotted on sterile filter paper and placed on co-cultivation medium. The 

explants were immersed in bacterial suspension, blotted on steriie fiiter paper and 
placed on co-cultivation medium (MS salts, LS vitamins, 3% sucrose, 0.1 mg/1 kinetin, 
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0.2 rag/I 2,4-D, 200 mgfl potassium acid phosphate, 50 pM acctosyringonc and 0.7% 
agar, pH 5.4) for 2-3 days. 

[00134] After two to three days of co-culture, the explains were transferred to shoot 

regeneration medium containing MS salts, Nitsch vitamins, 3% sucrose, 2 mg/1 zeatin, 
500 mg/1 carbenicillin, 200mg/L timetin and 0.7% agar at pH 5.8, supplemented with 
the antibiotic, kanamycin at 75 - 400 mg/1 in order to select for nptff-expressing 
transformants. The selection level of antibiotic was gradually raised over an 8-week 
period based on the tissue response. 

[00135] The explants were transferred to fresh medium every two weeks. Initiation of 

callus with signs of shoot initials was observed from 3*6 weeks depending on the type 
of explant Callusing and shoot regeneration was observed to continue over 
approximately 4 months after which the explant tissues decline. A purple callus was 
observed among the tissue growing on the selection medium. Regenerated shoots 
displayed a variety of color phenotypes and were entirely green, entirely purple, or mix of 
green and purple to various degrees. Green shoots of approximately 1 cm in size with 
distinct shoot meristems were excised from the callus and transferred to root induction 
medium containing MS salts, Nitsch vitamins, 3% sucrose, 1 mg/1 IBA, 50 mg/1 
kanamycin, 100 mg/l carbenicillin or lOOmg/L timetin and 0.7% agar, pH 5.8. The 
rooted plants were out-planted to soil in a Biosafety greenhouse. 

[00136] Plants were transported to greenhouse facilities, potted up in 3.5" pots tagged for 
plant identification. 

[00137] Transformants were observed at the callus stage and after T| plants were 

established in the greenhouse for phenotypic variations relative to wild-type Micro- 
Tom plants. To achieve this, several wild-type plants were kept in close proximity to 
the transgenic plants. Each plant was observed closely twice a week with observations 
noted and documented by photographs. 

[00138] Images of each pool of 8 plants were recorded using a Digital camera (DC-260), 
and morphology observations were made at about four weeks after planting. 

[00139] Eleven Micro-Tom lines were developed from the callus originally identified by its 
purple color and purple shoots at the caulogenic stage. The clonal plant lines were 
identified as having modified leaf color with a heavy purple cast on leaves, modified 
flower color characterized by purple striations on petals and sepals and flowers with a 
pmple cmt mixed with the norma! yellow ecler of the carslh. The plants vvsrs a!sc 
observed to exhibit a modified fruit color described as a deeper red color relative to 
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wild type Micro-Tom plants. The clonal plant lines (mutants) were designated 

Anthocyaninl CANTl"). 
[00140] The ANT1 mutant was identified from fewer than 2000 individual Micro-Tom 

tomato ACTTAG lines that were developed following tissue culture transformation 

with the binary plasmid pAG3202, and selection on kanaraycin-containing medium. 
[00141] Observations were made and photos taken of the clonal T| ANT1 plant lines that 

exhibited the ANT1 phenotype, designated H000001484, H000001624, H000001708, 

H000001709, H000001710, H00000171 1, H000001712, H000001713, H000001715, 

H000001716 and H000001717. 
[00142] Seeds were collected from Ti plants from line H000001624 and grown to generate 

T2 plants. From the 1 1 out of the 18 seeds that germinated, and 8 plants displayed 

purple coloration, confirming that ANT1 is a dominant mutation. 
[00143] The results indicated that ANT1 is a gain of function trait, expected from activation 

tagging based over-expression of a native gene. 



EXAMPLE 2 

Characterization of Plants That Exhibit the ANT1 Phenotype 
[00143] Micro-Tom genomic DNA was extracted from the H000001484 clone of the 

activation tagged mutant originally identified at the callus stage, in sufficient yield and 
quality for plasmid rescue of activation tagged plant lines using the procedure described 
below. Further analysis was performed using combined tissue derived from the 
H000001624, H000001708, H000001709, H000001710, H00000171 1, H000001712, 
H000001713, H000001715, H000001716 and H000001717 plant lines. 

I. MicroTom Tomato Genomic DNA Extraction 
[00144] Nucleon™ PhytoPure™ systems (Plant and fungal DNA extraction kits) from 
Araersham™ were used for extracting genomic DNA. Methods were essentially as 
follows: 

[00145] l.Og of fresh tissue from the H000001484 clone was ground in liquid nitrogen to 
yield a free flowing powder, then transferred to a 15 ml polypropylene centrifuge tube. 
4.6 ml of Reagent 1 from the Nucleon Phytopure kit was added with thorough mixing, 
followed by addition of 1.5 ml of Reagent 2 from the Nucleon Phytopure kit, with 

inversion until a homogeneous mixture was obtained. The mixture was incubated a: 
65°C in a shaking water bath for 10 minutes, and placed on ice for 20 minutes. The 
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samples were removed from the ice, 2 ml of - 20°C chloroform added, mixed and 
centrifuged at BOOg for 10 minutes. The supernatant was transferred into a fresh tube, 
2 ml cold chloroform, 20Q \d of Nucleon PhytoPure DNA extraction resin suspension 
added and the mixture shaken on a tilt shaker for 10 minutes at room temperature, then 
centrifuged at BOOg for 10 minutes. Without disturbing the Nucleon resin suspension 
layer, the upper DNA-containing phase was transferred into a fresh tube, centrifuged at 
9500 rpm for 30 minutes to clarify the transferred aqueous phase if the upper phase 
appeared cloudy, an equal volume of cold isopropanol added, and the tube gently 
inverted until DNA precipitated. It was then pelleted by centrifitgation, washed with 
cold 70% ethanol, pelleted again, and air-dried 
[00146] DNA was resuspended in TE buffer (10 mM Tris. HC1, pH 7.4, 1 mM EDTA), 

containing RNase, incubated at 55° C for IS minutes, further extracted 
phenol/chloroform, then chloroform, run on a 1% agarose gel to check the DNA 
Quality, the DNA concentration determined by a DNA fluorometer (Hoeffer DyNA 
Quant 200). 

[00147] DNA extracted from shoots of the H000001484 ANT1 clone at the caulogenic 

callus stage and from wild type plants was PCR-amplified using primers that amplify a 
35S enhancer sequence, and primers that amplify a region of the pBluescript vector 
sequence in pAG3202. Amplification using primers that span the 35S enhancer region 
resulted in a ladder of products, indicating that all four copies of the 35S enhancer 
were present. Amplification using primers to the pBluescript vector was done 
primarily to detect the T-DNA inserts) in transformed plants and has been optimized 
for the following conditions: annealing temp: 57*C, 30 cycles [94*C, 30sec; 57 # C, 1 
min; 72*C, 1 min] 1 cycle [72°C, 7 min]. 

[00148] The ACITAG™ line, H000001484 (AJV77), was confirmed as positive for the 

presence of 35S enhancer and pAG3202 vector sequences by PCR, and as positive for 
Southern hybridization verifying genomic integration of the ACTTAG DNA and 
showing the presence of a single T-DNA insertion in the clonal transgenic line. 



EL Plasmid Rescue 

[00149] Genomic DNA from the H000001484 clonal line was digested by the restriction 
enzymes used in Southern Hybridization. The restriction fragments were self-ligated 

and used to transform the E. coli cells. The plasmids that contained a full-length 
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pBluescript vector, 4X 35S enhancer, and a right border T-DNA flanking genomic 

DN A fragment were rescued 
[00150] More specifically, genomic DNA was digested with Hind m and Xho I under 

standard reaction conditions at 37°C overnight. 
[00151] The ligation reactions were set up containing the following and left at 16°C 



overnight: 

Digested Genomic DNA 40 \jd 

5X Ligation Buffer SO jxl 

Ugase(Gibcol, lU/fil) 10 |il 

ddH 2 Q 150 |il 



[00152] The ligated DNA precipitated, resuspended in ddHaO and used to transform E. coli 
SURE cells (Stratagene) via electroporation, with 10 pg of pUC18 plasmid as a 
control 

[00153] The transformation mixture was spread on two LB-plates containing 100 ng/ml 
ampicillin and incubated overnight at 37°C. Single colonies were picked from the 
plates and used to start a 5 ml LB-ampicillin broth culture from each colony by 
culturing overnight at 37°C. The plasmid was extracted from the culture and 
restriction digested to confirm the size of genomic insertion. 

IE. Sequencing Of Rescued Plasmids 

[00154] Sequencing was accomplished using a ABI Prism BigDye™ Terminator Cycle 

Sequencing Ready Reaction Kit (PE Applied Biosystem), AmpIiTaq DNA Polymerase 
(Perkin-EImer), an ABI Prism™ 310 Genetic Analyzer (Perkin-Elmer) and sequence 
analysis software, e.g. t Sequencer™ 3.1.1 or MacVector 6.5.3. Sequencing was done 
essentially according to manufacturers 1 protocols 

[00155] The left ends of plasmids rescued were sequenced across the right T-DNA border. 

[00156] The rescued sequence was subjected to analysis using the BLAST sequence 
comparison programs at the www.ncbi.nlm.nih.gov/BLAST website. A basic 
BLASTN search identified a sequence with 31% identity to the Anthocyanin 2 (An2) 
mRNA of Petunia integrifolia (GI 7673087 and 7673085). The presence of an open 
reading frame (i.e., the ANT1 cDNA) was predicted using the BLASTX program. 
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[00157] RT-PCR analysis confirmed that the gene whose nucleotide sequence is presented 
as SEQ ID NO: 1 (ANT1) was specifically overexpressed in tissue from plants having the 
ANT1 phenotype. Specifically, RNA was extracted from combined tissues derived from 
the H000001624 clonal plant line, which exhibited the ANT1 phenotype, and from wild 
type plants. RT-PCR was performed using primers specific to the sequence presented 
as SEQ ID NO: 1 and a constitutively expressed actin gene (positive control). The 
results showed that plants displaying the ANT1 phenotype over-expressed the mRNA 
for the ANT1 gene, indicating the enhanced expression of the ANT1 gene correlated 
with thcANTl phenotype. 

[00158] The amino acid sequence predicted from the ANT 1 nucleic acid sequence was 
determined using Vector NTI (InforMax, North Bethesda, MD) and is presented in 
SEQ ID NO:2. A Basic BLASTP 2.0. 1 1 search using the ncbi.nlm.nih.gov/BLAST 
website and the predicted ANT1 amino acid sequence was conducted. Results 
indicated that the predicted ANT1 protein sequence has 49% identitiy to the Petunia 
integrifolia An2 protein sequence (GI 7673088 and 7673086) and 65%~85% identity to 
several Myfc-related transcription factors in the N-terminal region, from approximately aa 
1-1 20 of SEQ ID NO.2. These Afyfr-related proteins included An2 from Petunia x hybrida 
(OI 7673084), the Zea mays CM (GI 22214), the Zea mays PL transcription factor (GI 
2343273) and an Arabidopsis transcription factor (GI 3941508). The Petunia An2 
gene is a regulator of the Anthocyanin biosynthetic pathway (Quattrocchio et al, 
1999). 

[00159] These results suggest that ANT1 is associated with modified leaf, flower or fruit 
color in Micro-tomato. 

EXAMPLE 3 

Confirmation of Phenotvpe/Genotvoe Association in Micro-tomato 
[00160] In order to further confirm the association between the ANT1 phenotype and the 
ANT1 gene presented in SEQ ID NO: 1, a genomic fragment comprising the ANT1 
gene, provided in SEQ ID NO:4 f was over-expressed in wild type Micro-Tom plants. 
Specifically, this 1012 bp genomic fragment, including theANTl coding regions, was 
cloned into the multiple cloning site (MCS) of the binary vector pAG2370. pAG2370, 
whose sequence is provided in SEQ ID NO:5 f comprises the vector backbone from the 
binaiy vector pBINlS (GI! 256363), T-DNA left and right border fragments, and, 
between border fragments, the CsVMV promoter sequence and a Nos termination 
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sequence for controlling expression of the inserted gene, and the neomycin 
phosphotransferase (NPTO) gene, which confers kanamycin resistance, whose 
expression is controlled by the RE4 promoter (US Patent No. 6054635) and the 07 
termination sequence. The ANTl fragment was cloned into Smal/Spel sites of 
pAG2370, inserted between the CsVMV promoter region, proximal to the 5' end of 
genomic fragment, and the Nos termination sequence, proximal to the 3' end of the 
genomic fragment. The pAG237(MA77 construct was transformed into 
Agrobacterium tumefaciens by electroporation. 
[00161] The pAG2370-AN77 construct described above was introduced into wild-type 
Micro-Tom plants via Agrobacterium-medmted transformation, essentially as 
described in Example 1. Briefly, explants were dissected from Micro-Tom seedlings. 
Explants were inoculated by soaking in the Agrobacterium suspension for 15 to 120 
minutes, blotted on sterile filter paper to remove excess bacteria, and plated. Explants 
were co-cultivated in non-selective media for 2-4 days at 24°C with a 16-hour 
photoperiod, after which they were transferred to selective media (with kanamycin) 
and returned to the growth room. Explants were transferred to fresh medium every 
two weeks until shoots were 0.5 to 1 cm tall. Shoots were excised from the explants, 
placed oh selective medium with kanamycin in Phytatrays (Sigma), and returned to the 
growth room for two to four weeks. Shoots were observed for rooting, and rooted 
shoots were out-planted to soil and acclimated to the greenhouse. The transformation 
process generated 64 independent T 0 events. Morphological observations demonstrated 
that 45 transgenic plants displayed the ANTl purple color phenotype and were either 
partially or entirely purple. Tissue was collected from six T| plants showing the ANTl 
phenotype, and RT-PCR was carried out using wild type as a control. While no ANTl 
gene expression could be detected in the wild-type control, five out of the six plants 
displaying the ANTl phenotype over-expressed the ANTl transcript. The internal 
control experiments, using a constitutively expressed actin gene, showed that all 
samples had similar levels of the actin expression. 

EXAMPLE 4 

Confirmation of Phenotype/Genotype Association in Arabidoosis 
[00163] In order to further confirm the association between the ANTl phenotype and the 

ANTl gene iz plants other than Micro Tom, the ANTl gene was introduced into and 
over-expressed in wild type Arabidopsis thaliana. 
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[00164] The pAG2370~AA77 construct described above was introduced into wild-type 

Arabidopsis plants via Agrobacterium-mzdxzX&d transformation using standard vacuum 
infiltration methods. All infiltrated seeds were plated in selective media containing 
tcanamycin, and kanamycin-resistant T, plants were transplanted to 72-cell flats. The 
transformation process generated 10 independent To events, of which seven displayed 
the ANT1 purple coloration phenotype in at least part of the plant Tissue was 
collected from four T| plants showing the ANT1 phenotype, and RT-PCR was carried 
out using wild type as a control. While no ANT1 gene expression could be detected in 
the wild-type control, all plants displaying the ANT1 phenotype over-expressed the 
ANT1 transcript The internal control experiments, using a constitutively expressed 
actin gene, showed that all samples had similar levels of the actin expression. 



EXAMPLES 

Confirmation of Phenotvne/Genotvpe Association in Tobacco 
[00166] In order to further confirm the association between the ANT1 phenotype and the 
ANT1 gene in plants other than Micro-Tom, the ANT I gene was introduced into and 
over-expressed in wild type Nicotiana tabacum (tobacco, Wisconsin-38 type). 
100167] The pAG2370-AAT/ construct described above was introduced into wild-type 
tobacco plants via Agrobacterium-tnediated transformation using essentially the 
following methods. In order to generate tobacco plants for transformation, tobacco 
seeds were germinated as follows: seeds were shaken about ten minutes on a lab 
shaker, in a solution containing approximately 1.3% to 2.1% sodium hypochlorite and 
one drop of Tween-20 (Polyoxyethylenesorbitan monolaurate) per 100 milliliters. 
Seeds were then washed in sterile water and sterilely transferred to the surface of 
TbSG medium (4.3 g/1 Murashige and Skoog salts, Phytotech; 1 ml/1 MS vitamins, 
Sigma; 30 g/1 sucrose; 8 g/1 agar, Sigma; pH adjusted to -5.8) in petri dishes or 
Phytatrays (Sigma), 10-50 seeds per vessel, and incubated in light at 25°C. Tobacco 
plants were dissected on sterile filter paper moistened with sterile, deionized water or 
liquid TbCo medium (4.3 g/1 Murashige and Skoog salts, Phytotech; 1 ml/1 MS 
vitamins, Sigma; 30 g/1 sucrose; 200 mg/1 KH 2 P0 4 ; 2 mg/1 Indole-3-acetic acid; 0.25 
mg/1 Kinetin; 0 to 100/rM Acetosyringone; 7 g/1 Agar, Sigma; pH adjusted to 5.4-5.6). 
Explants with cut edges on all sides could be generated by cutting the leaf from the 
plant, dissecting out and discarding Ehe snidvefo, and cutting the !eaf into 3 to 5 
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mm squares. Alternatively, discs could be cut from the lamina using a sterilized cork 
bora*. 

[00168] Explants were inoculated by soaking for 15-120 minutes in Agrobacterium 
suspension (OD600 between 0.175 and 0.225) prepared with the pAG2370-/UV77 
construct, then blotted and plated on TbCo medium. Explants were co-cultivated 2-4 
days at 24°C with a 16-hour photoperiod, and then transferred to Tb selective medium 
(4.3 gfl Murashige and Skoog salts; 1 ml/1 Nitsch and Nitsch vitamins, Duchefa; 30 g/1 
sucrose; 0.5 to 2 mg/1 6- Benzylaminopurine; 0 to 1 mg/1 Naphthylacetic Acid; 0 to 
750 mg/1 Carbenicillin; 0 to 300 mg/1 Timentin; 0 to 500 mg/1 Kanamycin; 7 to 8 g/1 
Agar, Sigma; pH adjusted to -5.8) containing kanamycin and re-transferred every two 
weeks until shoots were 0.5 to 1 cm tall. Shoots were excised from the explants, 
placed on TbR medium (4.3 g/1 Murashige and Skoog salts; 1 ml/1 Nitsch and Nitsch 
vitamins, Duchefa; 30 g/1 sucrose; 0 to 1 mg/1 Indole-3-butyric acid; 0 to 1 mg/1 
Naphthylacetic Acid; 0 to 100 mg/1 Carbenicillin; 0 to 200 mg/1 Timentin; 0 to 100 
mg/1 Kanamycin; 7 to 8 g/1 Agar, Sigma; pH adjusted to -5.8.) with kanamycin in 
Phytatrays, and grown two to four weeks, after which time the rooted shoots were 
planted to soil. 

[00169] The transformation process generated 89 independent To events, of which 54 

displayed the ANT1 purple coloration phenotype in at least part of the plant. Tissue 
was collected from five T| plants showing the ANT1 phenotype, and RT-PCR was 
carried out using wild type as a control. While no ANT1 gene expression could be 
detected in the wild-type control, all plants displaying the ANT1 phenotype over- 
expressed the ANT1 transcript. The internal control experiments, using a constituti vely 
expressed actin gene, showed that all samples had similar levels of the actin 
expression. 

EXAMPLE 6 

Use of the ANT1 gene as a transformation marker in tomato and tobacco 
[00171] Having successfully recapitulated the ANT1 phenotype in tomato and tobacco, as 
described above, we tested the utility of the ANT1 gene for utility as a transformation 
marker, based on its characteristic purple color, in these species. We transformed 
tobacco and Micro-Tom explants with the pAG2370-/UV77 vector, using methods 
described in the above Examples, grew the explants in the presence and absence of 
antibiotic (kanamycin), and compared transformation frequency based on rooting in 
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the presence of antibiotic in the media to transformation frequency based on purple 
color Results are shown in the Table below. 

[00172] Table 1: 



Transformation frequency of tobacco and tomato, based on antibiotic selection or color 



Species 


Kanamycin 
in media 


# 

explants 


Transformation 
frequency based on 
rooting in presence of 
antibiotic 


Transformation 
frequency based on 
purple color 


Tobacco 
Wisconsin 


+ 


82 


126%* 


80% 




60 




45% 


Tomato 
Micro- 
Tom 


+ 


103 


77% 


54% 




52 




6% 



♦This number reflects multiple transgenic events per original explain. When callus 
initiation occurs at two or three distinct points on the original explant, each is dissected 
and tested for shoot regeneration. 



[00173] The results indicated that the ANT1 gene could be successfully used for screening 
of positive transformants in cultures of tomato and tobacco, and may be useful in other 
plants as well. 
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IT IS CLAIMED: 

1 . An isolated polynucleotide comprising a nucleic acid sequence which encodes 
or is complementary to a sequence which encodes an ANT1 polypeptide having at least 
70% sequence identity to the amino acid sequence presented as SEQ ID NO:2. 

2. The polynucleotide of Claim 1 comprising a nucleic acid sequence that 
hybridizes under high stringency conditions to the nucleic acid sequence presented as SEQ 
ID NO: 1 , or the complement or a fragment thereof. 

3. The polynucleotide of Claim 1 wherein the ANT1 polypeptide has at least 80% 
sequence identity to the amino acid sequence presented as SEQ ID NO:2. 

4. The polynucleotide of Claim 1 wherein the AAtfV polypeptide has at least 90% 
sequence identity to the amino acid sequence presented as SEQ ID NO:2. 

5* The polynucleotide of Claim 1 wherein the ANT1 polypeptide has the amino 
acid sequence presented as SEQ ID NO:2. 

6. The polynucleotide of Claim 1 comprising the nucleic acid sequence presented 
as SEQ ID NO:l, or the complement thereof. 

7. A plant transformation vector comprising an isolated polynucleotide of Claim 1. 

8. A transgenic plant cell comprising the vector of Claim 7. 

9. A method of producing an ANT1 phenotype in a plant, said method comprising 
introducing into progenitor cells of the plant a plant transformation vector according to 
claim 7 and growing the transformed progenitor cells to produce a transgenic plant, 
wherein said polynucleotide sequence is expressed and said transgenic plant exhibits an 
ANT1 phenotype. 

10. A plant obtained by a method of Claim 9. 

1 1 . A plant part obtained from a plant according to Claim 10. 
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12. A method of selecting a transformed plant comprising a first polynucleotide 
comprising the steps of: 

(a) introducing into progenitor cells of the plant a plant transformation vector comprising 
the first polynucleotide and an ANT1 polynucleotide according to Claim 1, and 

(b) growing the progenitor cells to produce a plant that displays the ANT1 phenotype, 
wherein the plant that displays the ANT1 phenotype is selected as a transformed plant that 
also comprises the first polynucleotide. 
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SEQUENCE LISTING 

<110> Exelixis Plant Sciences, Inc. 

<120> IDENTIFICATION AND CHARACTERIZATION OF AN ANTHOCYANIN MUTANT 

(ANT1) IN TOMATO 

<130> EP01-002C-PC 

<150> US 60/244*685 

<151> 2000-10-30 

<160> 5 

<170> Patentln version 3.1 

<210> 1 

<211> 825 

<212> DMA 

<213> Lycopersicon esculentum 

<400> 1 



atgaacagta 


catctatgtc 


ttcattggga 


gtgagaaaag 


gttcatggac tgatgaagaa 


60 


gattttcttc 


taagaaaatg 


tattgataag 


tatggtgaag 


gaaaatggca tcttgttccc 


120 


ataagagctg 


gtctgaatag 


atgtcggaaa 


agttgtagat 


tgaggtggct gaattatcta 


180 


aggccacata 


tcaagagagg 


tgactttgaa 


caagatgaag 


tggatctcat tttgaggctt 


240 


cataagctct 


taggcaacag 


atggtcactt 


attgctggta 


gacttcccgg aaggacagct 


300 


aacgatgtga 


aaaactattg 


gaacactaat 


cttctaagga 


agttaaatac tactaaaatt 


360 


gttcctcgcg 


aaaagattaa 


caataagtgt 


ggagaaatta 


gtactaagat tgaaattata 


420 


aaacctcaac 


gacgcaagta 


tttctcaagc 


acaatgaaga 


atgttacaaa caataatgta 


480 


attttggacg 


aggaggaaca 


ttgcaaggaa 


ataataagtg 


agaaacaaac tccagatgca 


540 


tcgatggaca 


acgtagatcc 


atggtggata 


aatttactgg 


aaaattgcaa tgacgatatt 


600 


gaagaagatg 


aagaggttgt 


aattaattat 


gaaaaaacac 


taacaagttt gttacatgaa 


660 


gaaatatcac 


caccattaaa 


tattggtgaa 


ggtaactcca 


tgcaacaagg acaaataagt 


720 


catgaaaatt 


ggggtgaatt 


ttctcttaat 


ttaccaccca 


tgcaacaagg agtacaaaat 


780 


gatgattttt 


ctgctgaaat 


tgacttatgg 


aatctacttg 


attaa 


825 



<210> 2 
<211> 274 
<212> PRT 

<213> Lycopersicon esculentum 
<400> 2 

Met Asn Ser Thr Ser Met Ser Ser Leu Gly Val Arg Lys Gly Ser Trp 
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Thr Asp Glu Glu Asp Phe Leu Leu Arg Lys Cya lie Asp Lya Tyr Gly 
20 25 30 



Glu Gly Lys Trp His Leu Val Pro lie Arg Ala Gly Leu Asn Arg Cys 
35 " 40 45 



Arg Lys Ser Cys Arg Leu Arg Trp Leu Asn Tyr Leu Arg Pro His lie 
50 55 60 



Lys Arg Gly Asp Phe Glu Gin Asp Glu Val Asp Leu He Leu Arg Leu 
65 70 75 00 



His Lys Leu Leu Gly Asn Arg Trp Ser Leu He Ala Gly Arg Leu Pro 
85 90 95 



Gly Arg Thr Ala Asn Asp Val Lys Asn Tyr Trp Asn Thr Asn Leu Leu 
100 105 110 



Arg Lys Leu Asn Thr Thr Lys He Val Pro Arg Glu Lys He Asn Asn 
115 120 125 



Lys Cys Gly Glu He Ser Thr Lys He Glu He He Lys Pro Gin Arg 
130 135 140 



Arg Lys Tyr Phe Ser Ser Thr Met Lys Asn Val Thr Asn Asn Asn Val 
145 150 155 160 



He Leu Asp Glu Glu Glu His Cys Lys Glu He He Ser Glu Lys Gin 
165 170 175 



Thr Pro Asp Ala Ser Met Asp Asn Val Asp Pro Trp Trp He Asn Leu 
180 185 190 



Leu Glu Asn Cys Asn Asp Asp He Glu Glu Asp Glu Glu Val Val He 
195 200 205 



Asn Tyr Glu Lys Thr Leu Thr Ser Leu Leu His Glu Glu He Ser Pro 
210 215 220 



Pro Leu Asn He Gly Glu Gly Asn Ser Met Gin Gin Gly Gin He Ser 
225 230 235 240 



His Glu Asn Trp Gly Glu Phe Ser Leu Asn Leu Pro Pro Met Gin Gin 
245 250 255 



Gly Val Gin Asn Asp Asp Phe Ser Ala Glu lie Asp Leu Trp Asn Leu 
260 265 270 
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<210> 3 
<211> X0078 
<212> DNA 
<213> pAG3202 

<400> 3 

tccctttagt gagggttaat tccgagcttg gcgtaatcat ggtcatagct gtttcctgtg 60 

tgaaattgtt atccgctcac aattccacac aacatacgag ccggaagcat aaagtgtaaa 120 

gcctggggtg cctaatgagt gagctaactc acattaattg cgttgcgctc actgcccgct 180 

ttccagtcgg gaaacctgtc gtgccagctg cattaatgaa tcggccaacg cgcggggaga 240 

ggcggtttgc gtattgggcg ctcttccgct tcctcgctca ctgactcgct gcgctcggtc 300 

gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg taatacggtt atccacagaa 360 

tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt 420 

aaaaaggccg cgttgctggc gtttttccat aggctccgcc cccctgacga gcatcacaaa 480 

aatcgacgct caagtcagag gtggcgaaac ccgacaggac tataaagata ccaggcgttt 540 

ccccctggaa gctccctcgt gcgctctcct gttccgaccc tgccgcttac cggatacctg 600 

tccgcctttc tcccttcggg aagcgtggcg ctttctcata gctcacgctg taggtatctc 660 

agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc 720 

gaccgctgcg ccttatccgg taactatcgt cttgagtcca acccggtaag acacgactta 780 

tcgccactgg cagcagccac tggtaacagg attagcagag cgaggtatgt aggcggtgct 840 

acagagttct tgaagtggtg gcctaactac ggctacacta gaaggacagt atttggtatc 900 

tgcgctctgc tgaagccagt taccttcgga aaaagagttg gtagctcttg atccggcaaa 960 

caaaccaccg etggtagcgg tggttttttt gtttgcaagc agcagattac gcgcagaaaa 1020 

aaaggatctc aagaagatcc tttgatcttt tctacggggt ctgacgctca gtggaacgaa 1080 

aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac ctagatcctt 1140 

ttaaattaaa aatgaagttt taaatcaatc taaagtatat atgagtaaac ttggtctgac 1200 

agttaccaat gcttaatcag tgaggcacct atctcagcga tctgtctatt tcgttcatcc 1260 

atagttgcct gactccccgt cgtgtagata actacgatac gggagggctt accatctggc 1320 

cccagtgctg caatgatacc gcgagaccca cgctcaccgg ctccagattt atcagcaata 1380 

aaccagccag ccggaagggc cgagcgcaga agtggtcctg caactttatc cgcctccatc 1440 

cagtctatta attgttgccg ggaagctaga gtaagtagfcfc egeeagttaa fcagtttgcgc 1500 
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ycycctLtcc 


y*f -i ****** t' **£%*'*** 


rrrrrara f" nnt* t~ n 

gggc egg c eg 


6420 


cccccgccgc 


ft* ********* ft* ********* 
e gggc egg c g 


******** ft" ft" #*>ft* A ft* 

yCCyLLCtat 


**** ****** ft* **** n n 

yyece tycaa 


a f+t*t**f*7f* ** Aran 


a a rararara rafr* ran 






raraAra a/* ai*rn 


cggccgccgg 


cy u i.y ivy y ci w 


ciww winy cyya 


a AAraftr »*nrarara 


6540 


**/ A **• ft* 4*#A ^** A 

CLCaCCyaCa 


>*#a ft* *k ********** 

gacgaggggc 


a**#va ^*«frfr* ft- rra ** 

ggacgt tyac 


q ^-ft» ft* f* ******** 

ac c cgagggg 


** **na e*^ ** a ****** 


>**rararanrarararan 

cggcgcggcg 


6600 


ft* ft* rrn/i n #v a ft* /V 

Ctydcag acy 


ayyygCay gc 


ft- #*rta ft~ ft* ft* ** ****■ 
LLyaLLLtyy 


************ A/^#»ft" 

w w yyuyauy u 


ftnCk n**f" tin** f* 
yy<*yv»i*yyv-»w 


Aff^rat*rara<**AA 
ay ww wcy waa 


6660 


A ft" **** ******i A A A 


a ******** ft* na ft~ 

acg cc tyui-L 


ft- ♦* A f******* A fTt" 


ft* ft* ****** z\**£* rm 
v wwwwdwctyct 


Lyauy uyyav. 


n a rara ra *r rararara 
acty ww wy yy y 


6720 


A A A ft ft" ****** ** 

awaoywyCvw 


t^rarannl*' A t" f*n 




CTf*f* Ctf* flA /** ft" A 

yyv»yv>yav , -o 


**• ♦* raA a ra a ft*ra 


A rararrrararaoraA 
ayyyy wy v»y a 


6780 


ft* r* r~- h t" rr a f ap 
www w Lyu^oUi 


vwy f-yyyyLa 




an a t cra.cfcfcrcf 

^jj** **o^rJlf rj rJ 


cocacctatt 


aacatthaaa 

^uua w w vyny 


6840 


gggctgtcca 


caggcagaaa 


atccagcatt 


tgcaagggtt 


tccgcccgtt 


tttcggccac 


6900 


cgctaacctg 


tcttttaacc 


tgcttttaaa 


ccaatattta 


taaaccttgt 


ttttaaccag 


6960 


ggctgcgccc 


tgtgcgcgtg 


accgcgcacg 


ccgaaggggg 


gtgccccccc 


ttctcgaacc 


7020 


ctcccggccc 


getaacgegg 


gcctcccatc 


cccccagggg 


ctgcgcccct 


cggccgcgaa 


7080 
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cggcctcacc 


ccaaaaatgg 


cagcgctggc 


agtccttgsc 


wwwgcykgga 


tcggggcmst 


7140 


nnsggratgg 


gcgatcagcc 


cgagcgcgac 


gcccggaagc 


attgacgtgc 


cgcaggtgct 


7200 


ggcatcraca 


ttcagcgacc 


aggtgccggg 


cagtgagggc 


99 c 90 cc tgg 


gtggcggcct 


7260 


gcccttcact 


tcggccgtcg 


gggcattcac 


ggacttcatg 


gcggggccgg 


caatttttac 


7320 


cttgggcatt 


cttggcatak 


tggtcgcggg 


tgccgtgctc 


gtgttcgggg 


gtgaattgca 


7380 


agctagcttg 


cttggtcgtt 


ccggtaccgt 


gaacgtcggc 


tcgattgtac 


ctgcgttcaa 


7440 


atactttgcg 


atcgtgttgc 


gcgcctgccc 


ggtgcgtcgg 


ctgatctcac 


ggatcgactg 


7500 


cttctctcgc 


aacgccatcc 


gacggatgat 


gtttaaaagt 


cccatgtgga 


tcactccgtt 


7560 


gccccgtcgc 


tcaccgtgtt 


99099?aagg 


tgcacatggc 


tcagttctca 


atggaaatta 


7620 


tctgcctaac 


cggctcagtt 


ctgcgtagaa 


accaacatgc 


aagctccacc 


gggtgcaaag 


7680 


cggcagcggc 


ggcaggatat 


attcaattgt 


aaatggcttc 


atgtccggga 


aatctacatg 


7740 


gatcagcaat 


gagtatgatg 


gtcaatatgg 


aggaaaagaa 


agagtaatta 


ccaatttttt 


7800 


ttcaattcaa 


aaatgtagat 


gtccgcagcg 


ttattataaa 


atgaaagtac 


attttgataa 


7860 


aacgacaaat 


tacgatccgt 


cgtatttata 


ggcgaaagca 


ataaacaaat 


tattctaatt 


7920 


cggaaatctt 


tatttcgacg 


tgtctacatt 


cacgtccaaa 


tgggggctta 


gatgagaaac 


7980 


ttcacgatcg 


atatctagat 


ctcgagctcg 


agatctagat 


atcgataagc 


ttgcatgcct 


8040 


gcagcggaat 


tcccatcttg 


aaagaaatat 


agtttaaata 


tttattgata 


aaataacaag 


8100 


tcaggtatta 


tagtccaagc 


aaaaacataa 


atttattgat 


gcaagtttaa 


attcagaaat 


8160 


atttcaataa 


ct gat tat at 


cagctggtac 


attgccgtag 


atgaaagact 


gagtgcgata 


8220 


ttatgtgtaa 


tacataaatt 


gatgatatag 


ctagcttagc 


tcatcggggg 


atccgtcgaa 


8280 


gctagcttgg 


gtcccgctca 


gaagaactcg 


tcaagaaggc 


gatagaaggc 


gatgcgctgc 


8340 


gaatcgggag 


cggcgatacc 


gtaaagcacg 


aggaagcggt 


cagcccattc 


gccgccaagc 


8400 


tcttcagcaa 


tatcacgggt 


agccaacgct 


atgtcctgat 


agcggtccgc 


cacacccagc 


8460 


cggccacagt 


cgatgaatcc 


agaaaagcgg 


ccattttcca 


ccatgatatt 


cggcaagcag 


8520 


gcatcgccat 


gggtcacgac 


gagatcctcg 


ccgtcgggca 


tgcgcgcctt 


gagcctggcg 


8580 


aacagttcgg 


ctggcgcgag 


cccctgatgc 


tcttcgtcca 


gatcatcctg 


atcgacaaga 


8640 


ccggcttcca 


tccgagtacg 


tgctcgctcg 


atgcgatgtt 


tcgcttggtg 


gtcgaatggg 


8700 


caggtagccg 


gatcaagcgt 


atgcagccgc 


cgcattgcat 


cagccatgat 


ggatactttc 


o/ou 


tcggcaggag 


caaggtgaga 


tgacaggaga 


tcctgccccg 


gcacttcgcc 


caatagcagc 


8820 


cagtcccttc 


ccgcttcagt 


gacaacgtcg 


agcacagctg 


cgcaaggaac 


gcccgtcgtg 


8880 


geeageeacg 


atagecgegc 


tgcctcgtcc 


tgcagttcat 


tcagggcacc 


ggacaggtcg 


8940 
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gcgcccctgc 


nr* t" fin c* a err* t* 


ggaacacggc 


y y wc» way ay 


9000 




t»^C4y wi.ci way 


c oer a a t a err* f 


LliVCIliVvB 


oywyyvwyyo 


9060 


gaaccLycgt gcaacccauc 




ViCMiy t» LCCvd 


*-y S sf »» *> *" 


to^» *rf t* t* rr^ r* 


9120 


Evcggcgaca gagggaggac 


f-> nrtf' to t~ ♦* t* to a 


to f*MJ|MJ| BJ|fa f* 


CQaLtgddtC 


to A f*f+f*f1f+f 3* A 


9180 


agctactcag atacagacca 


Art#" AM A* toMrt/A 

dCLdCCCCgd 


aCydCdtyCC 


/■to to/* to AJk A to/w 

gccgcaaaug 


Udaduao CCU 


Q240 


agcctcgacc ccccgagcca 


afnf *»/ta to to _ 


#■ fa m #«fa> to nwf M 

gxagc uagig 


A rt •* to « to to a m to 

oC u uyt tact 


ytaooCyt ta 


7JVU 


acagcgaagc ggacgcgccg 


yagxggcuau 


cgggc a t gac 


tayaoatCot 


^ ■ I tmmmgmjm to m to 

g eggg c u c eg 




cgcgcaggcc cgaaccctgc 


cgtccacgcc 


a. fa. fa. fa, 4. _ _ fa.rrrr 


/rrrf- to r> to /«• <-» to to 

gyttatcctt 


to to to a to tort a/ysi 

tttaccgaga 




to to m a a am a aft'f^t'rtrttt^ 

ctcgaaagca atccccggat 


fa> to fa to S 

tcggttygta 


at.gcct.cgag 


rtrt rt rt ^ rt A A 

yCCCCCaaya 


oataaydyvy 




aagaatctga caatatgtag 


tototoAMAtoAtoj** 

tctagacatg 


taccctttga 


gtccataagt 


tototo^AAMtototo 

cc cgaagc c c 




gcatttccca tggtatgtaa 


taagttgatg 


cacaattttg 


^.fato/f^tototototo 

gc tyCLLtC V. 


^ to to to to AAA 

aLtCtayAad 




aaktacagga accgacaaaa 


ttgattataa 


acgtgagaaa 


ct cacg cgx t 


to A A A to A to ^vto CM 

bfiaaCauyta 


7QOU 


gaccgxcttc cgaccataca 


caacataaat. 


to « n a to to 

ccagaatttg 


& A to ^#to A to to to 

adtytaCtvv 


vBi^f- to a ant' 




aagaaaaacc caggcagccc 


ccgcacacca 


toAMMtortAAAto 

caccccaaac 


to to mm to to a to a a 

ccggucauaa 


a to to /^/v to A A to to 

dttcycaatt 


Q7RO 


ttgtataott caaaagtagt 


taatggaaac 


ataatagtat 


caaccagcac 


caaac t caac 


704U 


cataagttaa tgattacaac 


aaagcaattg 


tagcatcatt 


gtggtcgtgc 


cggagtggtt 


9900 


atcgggcatg actagaaatc 


atgtgggctt 


tgcccgcgca 


ggttcgaatc 


ctgccgacca 


9960 


cgttttttta tgacatcatt 


tttaatcttt 


ttctcaaatg 


gaccaccatt 


aagectaate 


10020 


ctgctgggtc tgggctaatc 


atctcaatta 


agcttcccgg 


gaaggcacca 


SY— 4- to to tort to 


1UU / o 


<210> 4 
<211> 1012 
<212> DNA 

<213> Lycopersicon esculentum 










<400> 4 

atgaacagta catctatgtc 


ttcattggga 


gtgagaaaag 


gttcatggac 


tgatgaagaa 


ou 


gattttcttc taagaaaatg tattgataag 


to r\ to Mrt ^MOnfi 

cacggcgaag 


yduadtyyCa 


to rt to tortto to/*/*/* 

tcttyttccc 


100 


ataagagctg gtaactatta aattaactat 


#*nf*/v(" to r» to to to 


toton to to toffto f^to 

ttatttytCL 


tof-/-»to/Yt"rttort^» 


1R0 


ttttatttga cgttattacg aatatcatct 


y act etc* v. y wov 


gtyLoyytut 


y»a uayo uy l> 


2A0 


cggaaaagtt gtagattgag gtggctgaat 


tatcuaagyc 


^araf*a(raa 
CaCatatwaa 


ft Art a rttt^ftste* 
y ay ay y tyaw 


JUv 


tttgaacaag atgaagtgga 


tctcattttg 


aggcttcata 


agctcttagg 


caacaggcat 


360 


gcaagtttat gttttgacaa 


aatttgatta 


gtatatatta 


tatatacgtg 


tgactatttc 


420 


atcEaaatgt taegttattt 


fcaegtagutg 




^ ;C ^.^"y t a a'tiC 




ess 


gacagctaac gatgtgaaaa 


actattggaa 


cactaatctt 


ctaaggaagt 


taaatactac 


540 
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taaaattgtt 


cctcgcgaaa 


agattaacaa 


taagtgtgga 


gaaattagta 


ctaagattga 


600 


aattataaaa 


cctcaacgac 


gcaagtattt 


ctcaagcaca 


atgaagaatg 


ttacaaacaa 


660 


taatgtaatt 


ttggacgagg 


aggaacattg 


caaggaaata 


ataagtgaga 


aacaaactcc 


720 


agatgcatcg 


atggacaacg 


tagatccatg 


gtggataaat 


ttactggaaa 


attgcaatga 


780 


cgatattgaa 


gaagatgaag 


aggttgtaat 


taattatgaa 


aaaacactaa 


caagtttgtt 


840 


acatgaagaa 


atatcaccac 


cattaaatat 


tggtgaaggt 


aactccatgc 


aacaaggaca 


900 


aataagtcat 


(jaaaattggg 


gtgaattttc 


tcttaattta 


ccacccatgc 


aacaaggagt 


960 


acaaaatgat 


gatttttctg 


ctgaaattga 


cttatggaat 


ctacttgatt 


aa 


1012 



<210> 5 

<211> 12241 

<212> ENA 

<213> pAG2370 

<400> 5 



tgagcgtcgc 


aaaggcgctc 


ggtcttgcct 


tgctcgtcgg 


tgatgtactt 


caccagctcc 


60 


gcgaagtcgc 


tcttcttgat 


ggagcgcatg 


gggacgtgct 


tggcaatcac 


gcgcaccccc 


120 


cggccgtttt 


agcggctaaa 


aaagtcatgg 


ctctgccctc 


gggcggacca 


cgcccatcat 


180 


gaccttgcca 


agctcgtcct 


gcttctcttc 


gatcttcgcc 


agcagggcga 


ggatcgtggc 


240 


atcaccgaac 


cgcgccgtgc 


gcgggtcgtc 


ggtgagccag 


agtttcagca 


ggccgcccag 


300 


gcggcccagg 


tcgccattga 


tgegggccag 


ctcgcggacg 


tgctcatagt 


ccacgacgcc 


360 


cgtgattttg 


tagccctggc 


cgacggccag 


caggtaggcc 


gacaggctca 


tgccggccgc 


420 


cgccgccttt 


tcctcaatcg 


ctcttcgttc 


gtctggaagg 


cagtacacct 


tgataggtgg 


480 


gctgcccttc 


ctggttggct 


tggtttcatc 


agccatccgc 


ttgccctcat 


ctgttacgcc 


540 


ggcggtagcc 


ggccagcctc 


gcagagcagg 


attcccgttg 


agcaccgcca 


ggtgcgaata 


600 


agggacagtg 


aagaaggaac 


acccgctcgc 


gggtgggcct 


acttcaccta 


tcctgcccgg 


660 


ctgacgccgt 


tggatacacc 


aaggaaagtc 


tacacgaacc 


ctttggcaaa 


atcctgtata 


720 


tcgtgcgaaa 


aaggatggat 


ataccgaaaa 


aatcgctata 


atgaccccga 


agcagggtta 


780 


tgcagcggaa 


aagcgccaeg 


cttcccgaag 


ggagaaaggc 


ggacaggtat 


ccggtaagcg 


840 


gcagggtcgg 


aacaggagag 


cgcacgaggg 


agcttccagg 


gggaaacgcc 


tggtatcttt 


900 


atagtcctgt 


cgggtttcgc 


cacctctgac 


ttgagcgtcg 


atttttgtga 


tgctcgtcag 


960 


gggggcggag 


cctatggaaa 


aacgccagca 


acgcggcctt 


tttacggttc 


ctggcctttt 


1020 


gctggccttt 


tgctcacatg 


ttctttcctg 


cgttatcccc 


tgattctgtg 


gataaccgta 


1080 


ttaccgcctc 


tgagtgagct 


gataccgctc 


gccgcagccg 


aacgaccgag 


cgcagcgagt 


1140 
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Ca g t gage ga 


ggaagcggaa 


gagcgccaga 


aggccgccag 


agaggc eg ag 


cgcggccgcg 






gc c agggcag 




ayccL g tayt 


gggc tgc tac 


gggcgcccga 




cgcggcggaa 




yatgt cgcct 


auaLyytULL 


gc tgxoytga 


gxgggxcgcg 


1*120 


c c ccggcagc 


ggccc cgacc 




o t" ♦* 1" M t 1 * rnnh 


^ ?s ^ fc» « r» a#v#«^ 

ccttcdacyL 


fta-f»J*ift* j**\ji gm #m A 

tcccgacaac 




ydyCCLCCLL 


cucyccuacc 


uaLUyaLaaL 


LaLLyuyayL 


ccc tgc ttyd 


acgccgcgcc 




cggaccggcc 


ccgccgaagg 


cgcccdccgc 


ggccc gcaac 


ageggegaga 


n/iriJTiM.jijijit ji 

gcggagcccg 


I3UU 


ttcaacggtg 


ccgccgcgcc 


cgccggcatc 


gc t g t c gc eg 


JBJI .ffm 4* MM ft» JA JB ft* 

gcccgccccc 


caagcacggc 


X30U 


cccaacagtg 


aagtagccga 


ccgccaccag 


cgcattgacg 


gcgtccccgg 


ccgaaaaacc 




cgcctcgcag 


aggaagcgaa 


gccgcgcgcc 


ggccgxcccc 


ate t gcggtg 


cgcccggccg 


IDOU 


cgtgccggca 


z ggac gcgcg 


K*n & ft"* ^ j**y*i» 

cgccaccgcg 


gx aggegage 


agegee tgee 


tgaagctgcg 




ggcattcccg 


a t c agaaa t g 


agcgc cag t c 


gtegtegget 


ctcggcaccg 


aatgcgtatg 


loUU 


attctccgcc 


agcatggctt 


cggccagtgc 


gtcgagcagc 


gcccgcttgt 


tcctgaagtg 


looU 


ccagtaaagc 


gccggctgct 


gaacccccaa 


ccgttccgcc 


agtttgcgtg 


tcgtcagacc 


1 no A 
1920 


gtctacgccg 


acctcgttca 


acaggtccag 


ggeggcaegg 


atcactgtat 


teggctgeaa 


i ABA 


ctttgccacg 


_**. a» a* *^«*». a *s «m ft* ft*- 

cttgacactc 


*m a fc» A *_, ft* jm ok ft* 

caccaccgac 


aaacataata 


tgtccaccaa 


cttatcagtg 


2 0«0 


ataaagaatc 


mm mm jl ■ ■ cm cm ftji ft* «** £k 

cgcgcgc cca 


accggaccag 


eggaggctgg 


tec ggaggee 


agacgtgaaa 




cccaacatac 


ccctgatcgt 


aattctgagc 


actgtcgcgc 


ccgacgc c gc 


cggcatcggc 




ctgaccatgc 


cggtgctgcc 


gggcctcctg 


cgcgatctgg 


ttcactcgaa 


cgacgtcacc 




gcccactatg 


gca tret get 


ggcgctgtat 


gcgttggtgc 


aacccgcctg 


cgcacctgtg 


22o0 


ctgggcgcgc 


tgtcggatcg 


tttcgggcgg 


cggccaatct 


tgctcgtctc 


gctggccggc 


23^0 


gccagatctg 


gggaaccctg 


tggttggcat 


gcacatacaa 


atggacgaac 


gga caaacc t 


Oil nn 
2400 


tttcacgccc 


ccccaaacat 


ccgactaccc 


taataaaege 


CCCCttCCCt 


caggtttacc 


24oO 


cgccaatata 


4» ^» ^r#» « a a 

ccc cgccaaa 


c act. garage 


ttaaactgaa 


ggegggaaac 


gacaatctga 


2920 


ft* ♦* |* A* A -0*1 >»^*i i**J 

ccacgagcgg 


£fc**#B flk ft* ft* B QA#f 

agaa c uaagg 


— W n j« ft* ^ Q f% ft) ft» ft* 

yagccacgc c 


atgacccccg 


a ft* #n* n /t#t/i 

ccgacgacgc 


gggacaagee 




gttttacgtc 


ft* Jmt+M A ft- j*v A 

cggaaccgac 


agaaccgcaa 


eg l cgaagga 


gccacccagc 


J* ft* Wi ftjH #\ *V ft» 

cgacccgaac 


2O40 


4- AAAAt ft- « ft* 

tcccgaccca 


graacataga 


cgacaccgcg 


AMAMCi ft* m A ft* ft* 

cgega c aa c c 


taccctagcx 


tgegegctat 


2 /00 


actccgxtcc 


CCaCCyCgua 


ccaaacgcac 


flat tgcggga 


ft* ^ ft* f» a ft* ^ Q ft* 


aaaaac ccac 


2 /bU 


ctcataaata 


acgtcatgca 


ttacatgtta 


attattacat 


gettaaegta 


attcaacaga 


2820 


aactatatga 


taatcatcgc 


aagaccggca 


acaggattca 


atcttaagaa 


actttattgc 


2880 


caaatgtttg 


aacgatcggg 


gaaattcgcg 


ageteggtae 


ccgctctaga 


actagtggat 


2940 


cccccgggct 


gcaggaattc 


aaacttacaa 


atttctctga 


acttgtatcc 


tcagtacttc 


3000 
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aaagaaaata 


gcttacacca 


aattttttct 


tgtctccaca 


aatgccgaac 


ttggttcctt 


men 


atataggaaa 


actcaagggc 


aaaaatgaca 


cggaaaaa ta 


taaaaggata 


agtagtgggg 




gataagattc 


ctttgtgata 


aggttacttt 


ccgcccccac 


atttcccacc 


4.fe„__.fe_fe_fe 

ccacacgcgc 




cctctatgtc 


tctttcacaa 


tcaccgacct 


t._*.-,4-4-_4»f. / - a 

tatcctctcc 


uccccaccgc 


| _ 4 ■ 4. — -» —4. 

cgxcgtcagc 


1940. 


gcttacgtct 


tcaagattct 


tttcttcgcc 


tggctcttcc 


ctttcaacct 


m4- llJluii , ■ hi- r, 

ctacgtattc 


J3UU 


ttcttcgtat 


tctggcagta 


taggatcttg 


tatctgtaca 


ttctccattt 


ttgaacatag 


1 1£ A 
J JOO 


gttgcatatg 


tgccgcatat 


tgatctgctt 


cttgctgagc 


tcacataata 


cttccatagt 


1 AO A 

3440 


ttttcccgta 


aacattggat 


tcttgatgct 


acatcttgga 


taattacctt 


ctcgtaccaa 


3480 


gcttaattga 


gatgattagc 


ccagacccag 


caggattagg 


cctaatggcg 


gtccatttga 


334 O 


gaaaaagatt 


aaaaatgatg 


tcataaaaaa 


acgtggtcgg 


caggattcga 


acctgcgcgg 


1 cftn 
3©0O 


gcaaagccca 


catgatttct 


agtcatgccc 


gataaccact 


ccggcacgac 


cacaatgatg 


3oo0 


ctacaattgc 


cttgttgtaa 


tcattaactt 


atggttgagt 


ttgatgctga 


4_4___4___^_4_ 

ttaatactat 


inn 


tatgtttcca 


ttaactactt 


ttgaagtata 


caaaattacg 


aatttataac 


caaatttgag 


3780 


gtataatatg 


cgagagctac 


ctaaattttt 


cttacttaat 


tttaaagtac 


— A- — — A- — 

attcaaattc 


3840 


tgaatttata 


ttgtgtatag 


tcagaaaaca 


atctacatat 


ttaaacacat 


aaatttctca 


3900 


cgtttataat 


caattttgtc 


ggttcctgta 


atttttctaa 


aataaaaagc 


aaccaaaatt 


3900 


gtgcatcaac 


ttattacata 


ccatgggaaa 


tgcaaacttc 


aaaacttatg 


gactcaaagg 


V! AO A 

40z0 


gtacatatct 


aaactacata 


ttgtcagatt 


cttcactctt 


atttcttgag 


ggcctcgagg 


4080 


cattaccaac 


caaatccaaa 


aattgctttc 


gaatctcaat 


aaaaaggata 


accccatgaa 


4 1 4 A 

4140 


aaagacgtgg 


acggcaggat 


tcgaacctgc 


gcgcagagcc 


cacatgattt 


ctagtcatgc 


4»>nft 

4200 


ccgataacca 


ctccggcacg 


tccacttcac 


tgttaacgtt 


tacagtaaca 


agtcactaac 


4/ DO 


tactaatcaa 


cattagctca 


ggaaatcaaa 


actagattat 


ttacatttac 


aacgaca t g t 


4 J4O 


cgttcgaagt 


agttggtctg 


tatctgagta 


gctttggcgg 


gtagattcaa 


tcgcatttct 


4380 


gcatataaaa 


ctgatcctcc 


ctctatcgcc 


aaagtcaaac 


tgaaaagggc 


cgggggcaag 


4440 


atatgggagc 


ttggattgaa 


caagatggat 


tgcacgcagg 


ttctccggcc 


gcttgggtgg 


a enn 
4300 


agaggctatt 


cggctatgac 


tgggcacaac 


agacaatcgg 


ccgctctgat, 


gccgccgtgt 


4doO 


tccggctgtc 


agcgcagggg 


cgcccggttc 


*- 4- 4- 4- *- «w4- *m mm. a* 

tttctgtcaa 


gaccgacctg 


tccggtgccc 


il CO A 


tgaatgaact 


gcaggacgag 


gcagcgcggc 


tatcgtggct 


ggccacgacg 


ggcgttcctt 


4680 


gcgcagctgt 


gctcgacgtt 


gtcactgaag 


cgggaaggga 


ctggctgcta 


ttgggcgaag 


4740 


tgccggggca 


ggatctcctg 


tcatctcacc 


ttgctcctgc 


cgagaaagta 


tccatcatgg 


4800 


ctgatgcaat 


gcggcggctg 


catacgcttg 


atccggctac 


ctgcccattc 


gaccaccaag 


4860 
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cgaaacatcg catcgagcga 


gcacgtactc 


ggatggaagc 


cggtcttgtc 


gatcaggatg 


4920 


atctggacga agagcatcag 


gggc t cgcgc 


cagccgaact 


gttcgccagg 


ctcaaggcgc 


4980 


gcatgcccga cggcgaggat 


ctcgtcgtga 


cccatggcga 


tgcctgcttg 


ccgaatatca 


5040 


tggtggaaaa tggccgcttt 


tctggattca 


tcgactgtgg 


ccggctgggt 


gtggcggacc 


5100 


gctatcagga catagcgttg 


gctacccgtg 


atattgctga 


agagcttggc 


ggcgaatggg 


5160 


ctgaccgctt cctcgtgctt 


tacggtatcg 


ccgctcccga 


ttcgcagcgc 


atcgccttct 


5220 


atcgccttct tgacgagttc 


ttctgacgat 


gagctaagct 


agctatatca 


tcaatttatg 


5280 


tattacacat aatatcgcac 


tcagtctttc 


atctacggca 


atgtaccagc 


tgatataatc 


5340 


agttattgaa atatttctga 


atttaaactt 


gcatcaataa 


atttatgttt 


ttgcttggac 


5400 


tataatacct gacttgttat 


tttatcaata 


aatatttaaa 


ctatatttct 


ttcaagatgg 


5460 


gaattaattc actggccgtc 


gttttacaac 


gtcgtgactg 


ggaaaaccct 


ggcgttaccc 


5520 


aacttaatcg ccttgcagca 


catccccctt 


tcgccagctg 


gcgtaatagc 


gaagaggccc 


5580 


gcaccgatcg cccttcccaa 


cagttgcgca 


gcctgaatgg 


cgcccgctcc 


tttcgctttc 


5640 


ttcccttcct ttctcgccac 


gt tcgccggc 


tttccccgtc 


aagctctaaa 


tcgggggctc 


5700 


cctttagggt tccgatttag 


tgctttacgg 


cacctcgacc 


ccaaaaaact 


tgatttgggt 


5760 


.gatggttcac gtagtgggcc 


atcgccctga 


tagacggttt 


ttcgcccttt 


gacgttggag 


5820 


tccacgttct ttaatagtgg 


actcttgttc 


caaactggaa 


caacactcaa 


ccctatctcg 


5880 


ggctattctt ttgatttata 


agggattttg 


ccgatttcgg 


aaccaccatc 


aaacaggatt 


5940 


ttcgcctgct ggggcaaacc 


agcgtggacc 


gcttgctgca 


actctctcag 


ggccaggegg 


6000 


tgaagggcaa tcagctgttg 


cccgtctcac 


tggtgaaaag 


aaaaaccacc 


ccagtacatt 


6060 


aaaaacgtcc gcaatgtgtt 


attaagttgt 


ctaagcgtca 


atttgtttac 


accacaatat 


6120 


atcctgccac cagccagcca 


acagctcccc 


gaccggcagc 


tcggcacaaa 


atcaccactc 


6180 


gatacaggca gcccatcagt 


ccgggacggc 


gtcagcggga 


gagccgttgt 


aaggcggcag 


6240 


actttgctca tgttaccgat 


gctattcgga 


agaacggcaa 


ctaagctgcc 


gggtttgaaa 


6300 


cacggatgat ctcgcggagg 


gtagcatgtt 


gattgtaacg 


atgacagagc 


gttgctgcct 


6360 


gtgatcaaat atcatctccc 


tcgcagagat 


ccgaattatc 


agccttctta 


ttcatttctc 


6420 


gcttaaccgt gacaggctgt 


cgatcttgag 


aactatgccg 


acataatagg 


aaatcgctgg 


6480 


ataaagccgc tgaggaagct 


gagtggcgct 


atttctttag 


aagtgaacgt 


tgacgatatc 


6540 


aactccccta tccattgctc 


accgaatggt 


acaggtcggg 


gacccgaagt 


tccgactgtc 


6600 


tjgee^gatgc atccccggct 






ctgagaaagc 


ccagtaagga 


6660 


aacaactgta ggttcgagtc 


gcgagatccc 


ccggaaccaa 


aggaagtagg 


ttaaacccgc 


6720 
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tccgatcagg ccgagccacg ccaggccgag aacattggtt cctgtaggca tcgggattgg 6780 

cggatcaaac actaaagcta ctggaacgag cagaagtcct ccggccgcca gttgccaggc 6840 

ggtaaaggtg agcagaggca cgggaggttg ccacttgcgg gtcagcacgg ttccgaacgc 6900 

catggaaacc gcccccgcca ggcccgctgc gacgccgaca ggatctagcg ctgcgtttgg 6960 

tgtcaacacc aacagcgcca cgcccgcagt tccgcaaata gcccccagga ccgccatcaa 7020 

tcgtatcggg ctacctagca gagcggcaga gatgaacacg accatcagcg gctgcacagc 7080 

gcctaccgtc gccgcgaccc cgcccggcag gcggtagacc gaaataaaca acaagctcca 7140 

gaatagcgaa atattaagtg cgccgaggat gaagatgcgc atccaccaga ttcccgttgg 7200 

aatctgtcgg acgatcatca cgagcaataa acccgccggc aacgcccgca gcagcatacc 7260 

ggcgacccct cggcctcgct gttcgggctc cacgaaaacg ccggacagat gcgccttgtg 7320 

agcgtccttg gggccgtcct cctgtttgaa gaccgacagc ccaatgatct cgccgtcgat 7380 

gtaggcgccg aatgccacgg catctcgcaa ccgttcagcg aacgcctcca tgggcttttt 7440 

ctcctcgtgc tcgtaaacgg acccgaacat ctctggagct ttcttcaggg ccgacaatcg 7500 

gatctcgcgg aaatcctgca cgtcggccgc tccaagccgt cgaatctgag ccttaatcac 7560 

aattgtcaat tttaatcctc tgtttatcgg cagttcgtag agcgcgccgt gcgtcccgag 7620 

cgatactgag cgaagcaagt gcgtcgagca gtgcccgctt gttcctgaaa tgccagtaaa 7680 

gcgctggctg ctgaaccccc agccggaact gaccccacaa ggccctagcg tttgcaatgc 7740 

accaggtcat cattgaccca ggcgtgttcc accaggccgc tgcctcgcaa ctcttcgcag 7800 

gcttcgccga cctgctcgcg ccacttcttc acgcgggtgg aatccgatcc gcacatgagg 7860 

cggaaggttt ccagcttgag cgggtacggc tcccggtgcg agctgaaata gtcgaacatc 7920 

cgtcgggccg tcggcgacag cttgcggtac ttctcccata tgaatttcgt gtagtggtcg 7980 

ccagcaaaca gcacgacgat ttcctcgtcg atcaggacct ggcaacggga cgttttcttg 8040 

ccacggtcca ggacgcggaa gcggtgcagc agcgacaccg attccaggtg cccaacgcgg 8100 

tcggacgtga agcccatcgc cgtcgcctgt aggcgcgaca ggcattcctc ggccttcgtg 8160 

taataccggc cattgatcga ccagcccagg tcctggcaaa gctcgtagaa cgtgaaggtg 8220 

atcggctcgc cgataggggt gcgcttcgcg tactccaaca cctgctgcca caccagttcg 8280 

tcatcgtcgg cccgcagctc gacgccggtg taggtgatct tcacgtcctt gttgacgtgg 8340 

aaaatgacct tgttttgcag cgcctcgcgc gggattttct tgttgcgcgt ggtgaacagg 8400 

gcagagcggg ccgtgtcgtt tggcatcgct cgcatcgtgt ccggccacgg cgcaatatcg 8460 

aacaaggaaa gctgcatttc cttgatctgc tgcttcgtgt gtttcagcaa cgcggcctgc 8520 

ttggcctcgc tgacctgttt tgccaggtcc tcgccggcgg tttttcgctt cttggtcgtc S580 
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ctcaacacga 


ttttacgtca 


cttaaaaaac 


agccgggcag 


tgacgtcatc 


gtctgcgcgg 


ctaaatcgcg 


ccagcgctgg 


ctgttttacg 


cgcacgtatt 


cggtgaacgc 


actatggcga 


cctttgacgt 


ggtgatatgg 


atgacggatg 


gaaagctgca 


cgtaatcagc 


aagcgatata 


tgaggcagca 


cctggcacgg 


ctgggacgga 


tgcatgacaa 


agtcatcggg 


cattatctga 


ttacccaatt 


atgatagaat 


ttacaagcta 


gtccatgcaa 


gtttttatgc 


tttgcccatt 


tgccttgccc 


cctgaaatcc 


ttacatacgg 


ttatcagtat 


tgtcaatata 


ttcaaggcaa 


tcttggtagc 


tttttaaata 


tggcgcttca 


ttttcatacc 


tcggtataat 


cttacctatc 


ccccgaacac 


gagcacggca 


cccgcgacca 


ccggccccgc 


catgaagtcc 


gtgaatgccc 


ccaggccgcc 


gccctcactg 


cccggcacct 


gcacgtcaat 


gcttccgggc 


gtcgcgctcg 


tcccggcaat 


ggcaaggact 


gccagcgctg 


aggggcgcag 


cccctggggg 


gatgggaggc 


gggggggcac 


cccccttcgg 


cgtgcgcggt 


acaaggttta 


taaatattgg 


tttaaaagca 


aacgggcgga 


aacccttgca 


aatgctggat 


aataggtgcg 


cccctcatct 


gtcagcactc 


catctgtcag 


tagtcgcgcc 


cctcaagtgt 


gtccacatca 


tctgtgggaa 


actcgcgtaa 


ggccagctcc 


acgtcgccgg 


ccgaaatcga 


ggtgagtcgg 


cccctcaagt 


gtcaacgtcc 


tccgcgaggt 


atccacaacg 


ccggcggccg 


tctggcgcgt 


ttgcagggcc 


atagacggcc 
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tcaggccgca 


gtcggtaacc 


tcgcgcatac 


10500 


aaatggacga 


acagtggggc 


tatgtcgggg 


10560 


cgtatgacag 


tctccggaag 


acggttgttg 


10620 


cgctggggcg 


tct tat gage 


ctgctgtcac 


10680 


gctggccgct 


gtatgaatcc 


cgectgaagg 


10740 


cgcagcgaat 


tgageggcat 


aacctgaatc 


10800 


agtcgctgtc 


gttctcaaaa 


tcggtggagc 


10860 


acataaaaca 


ctatcaataa 


gttggagtca 


10920 


taaggttatt 


gtcctgggtt 


tcaagcatta 


10980 


ctatagatat 


attgataagc 


gcgctgccta 


11040 


cgatatcttc 


tatataaaag 


atatattatc 


11100 


tctgcctcct 


catcctcttc 


atcctcttcg 


11160 


tagagtaatt 


ctgtaaaggt 


ccaattctcg 


11220 


acctcaaatg 


gttcgctggg 


tttategcac 


11280 


ctatgccaag 


aatgcccaag 


gtaaaaattg 


11340 


cgacggccga 


agtgaagggc 


aggccgccac 


11400 


ggtcgctgaa 


tgtcgatgee 


agcacctgcg 


11460 


ggctgatcgc 


ccatcccgtt 


actgccccga 


11520 


ccatttttgg 


ggtgaggccg 


ttcgcggccg 


11580 


ccgcgttagc 


gggcegggag 


ggttcgagaa 


11640 


cacgcgcaca 


gggegeagee 


ctggttaaaa 


11700 


ggttaaaaga 


caggttagcg 


gtggccgaaa 


11760 


tttctgcctg 


tggacagccc 


ctcaaatgtc 


11820 


tgcccctcaa 


gtgtcaagga 


tcgcgcccct 


11880 


caataccgca 


gggcacttat 


ccccaggctt 


11940 


aatcaggcgt 


tttcgecgat 


ttgegagget 


12000 


gcctgcccct 


catctgtcaa 


cgccgcgccg 


12060 


gcccctcatc 


tgtcagtgag 


ggccaagttt 


12120 


cggtgtctcg 


cacacggctt 


egaeggegtt 


12180 


gccagcccag 


eggegaggge 


aaccagcccg 


12240 
12241 
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